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TRANSPOSON 



Field of the Invention 

This invention relates to a new type of transposon. The transposon may be 
5 used in methods for the identification of ess ential and conditional essential genes, in 
particular in bacteria. 

. Background to The Invention 

The increase in prevalence of antibiotic-resistant bacteria, for example, has 
10 - renewed interest in the search for new targets for antibacterial agents. Essential 
genes and in particular the proteins which they encode may be good substrates for 
• use in screens for ahtibacterials, antiparasitics, fungicides, pesticides and herbicides. 
Essential genes and their protein products potentially represent such targets. 

Additionally, there is an interest in the identification of conditional essential . 
1 5 genes, that is genes which are essential for the survival of an organism in a particular 
environment. In the case of pathogenic bacteria,' for example, conditional essential 
genes include those which are required for survival in a host. Such genes and the 
proteins which they encode may also be good targets for use in screens for 
antibacterials. In particular, bacteria which carry mutations in such genes may be 
20 useful in attenuated live vaccines. 

Summary of The Invention 

Essential genes are those genes which, when missing (eg. because of a 
chromosomal deletion) or mutated to render them non-functional, result in a lethal 

25 phenotype. That is, they are genes without which an organism' cannot survive. 

Conditional essential genes are those genes which, although not absolutely essential 
for the survival of an organism under all conditions, are essential for survival under . 
various conditional restraints. Examples of particular conditional restraints include 
survival at elevated temperatures, and survival of a pathogen within its host 

30 A number of transposon-based strategies have been developed for the 
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identification of essential and conditional essential genes. We have now devised a 
new set of transposons. The transposons can be used in a variety of methods for the 
identification of essential and conditional essential gene in the genome of an 
organism. 

5 According ito the invention there is thus provided a transposon which 

comprises an RNA polymerase recognition site and a homing endonuclease 
recognition site. 

The invention also provides: 

use of a transposon of the invention in a method for the identification of an 
• 10- essential or a conditional essential gene; 

a method for identifying an essential gene of an organism, which method 
comprises: * . 

(i) providing a library of transposon insertion mutants of the said 
organism, wherein the transposon is a transposon of the invention; 
1 5 • (ii) isolating chromosomal DNA from the library of (i); 

(iii) digesting the chromosomal DNA with a restriction endonuclease that 
* is capable of cutting 5' to the RNA polymerase recognition site(s) in the transposon 
and 3 ! to the RNA polymerase recognition site(s). in the chromosomal DNA flanking 
the transposon insertion site; 
20 • (iv) transcribing the resulting digested DNA from the RNA polymerase 

recognition site(s) in the said DNA; 

(v) hybridising the resulting RNA with an oligonucleotide array; and 

(vi) identifying a probe on the oligonucleotide array which corresponds to 
an essential gene of the organism; 

25 - a method for identifying a conditional essential gene of an organism, which 
method comprises: 

(a) providing a first sample of a library of transposon insertion mutants of 
the said organism (input library); 

(b) providing a second sample of the library and subjecting that sample to 
30 a conditional restraint; 




(c) collecting the mutants that survive the conditional restraint in step (ii) 
to give a second library (output library); 

(d) carrying out a method according to steps (ii) to (iv) of the method set 
out above on the input library from step (a) and on the output library from step (c); 

5 (e) hybridising the transcribed RNA derived from the input library and 

♦ from the output library separately to copies of the same oligonucleotide array or, if 
the RNA derived from the two libraries is differentially labelled, to the same 
oligonucleotide array; and 

(f) . identifying a probe on the oligonucleotide array (s) which corresponds 

10- - - to- a- conditional- essential gene of the organism; 

a method for identifying an essential gene of an organism, which method 
comprises: 

(i) . providing a library of transposon insertion mutants of the said 
organism, wherein the transposon is a transposon of the invention; 
1 5 (ii) isolating chromosomal DNA from the library of (i); 

(iii) digesting the chromosomal DNA with a restriction endonuclease that 
is capable of cutting 5 1 to the RNA polymerase recognition site(s) in the transposon 
and 3' to the RNA polymerase recognition site(s) in the chromosomal DNA flanking 
the transposon insertion site; 
20 (iv) transcribing the resulting digested DNA from the RNA polymerase 

recognition site(s) in the said DNA; 

(v) ? reverse transcribing the resulting RNA; 

(v) n hybridising the resulting cDNA with an oligonucleotide array; and 

(vi) identifying a probe on the oligonucleotide array which corresponds to 
25 an essential gene of the organism; 

a method for identifying a conditional essential gene of an organism, which 
method comprises: 

(a) providing a first sample of a library of transposon insertion mutants of 
the said organism (input library); 
30 (b) providing a second sample of the library and subjecting that sample to 



a conditional restraint; 

(c) collecting the mutants that survive the conditional restraint in step (ii) 
to give a second library (output library); 

(d) carrying out a method according to steps (ii) to (v)' of the method set 
5" duf atibvedn^ and"on the"output library- fromrstep(c); 

(e) hybridising the reverse transcribed cDNA derived from the input 
library and from the output library separately to copies of the same oligonucleotide 
array or, if the cDNA derived from the two libraries is differentially labelled, to the 
same oligonucleotide array; and 

1 o (f) identifying a probe on the oligonucleotide array(s) which corresponds 

to a conditional essential gene of the organism; 

use of an essential or conditional essential gene identified by a method as set 
out above, or a polypeptide encoded by a said gene, in a method for identifying an 
inhibitor of transcription and/or translation of that gene and/or activity of a 
1 5 polypeptide encoded that gene; 

a method for identifying an inhibitor of transcription and/or translation of an 
essential or conditional essential gene and/or an inhibitor of activity of a polypeptide 
encoded by a said gene, which method comprises: 

(a) identifying an essential or conditional essential gene by a method as 

20 set out above; and 

(b) determining whether a test substance can inhibit transcription and/or 

• translation of a gene identified in step (a) and/or activity' of a polypeptide encoded by 
a said identified gene, thereby to identify a said inhibitor; 

an inhibitor identified by such a method according to claim; 
25 - an inhibitor of the invention for use in a method of treatment of a bacterial, 
fungal or eukaryotic parasite infection, wherein the essential or conditional essential 
gene used to identify the inhibitor is a bacterial, fungal or eukaryotic parasite 
essential or conditional essential gene; 

use of such an inhibitor in the manufacture of a medicament for use in the 
30 treatment of abacterial, fungal or eukaryotic parasite infection; 
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a pharmaceutical composition comprising such an inhibitor and a 
pharmaceutically acceptable carrier or diluent; 

a method of treating a host suffering from a b arterial, fungal or eukary otic 
parasite infection, which method comprises the step of adrninisterihg to the host a 
5 therapeutically effective amount of such an inhibitor, 

a method for the preparation of a pharmaceutical composition, which method 

comprises: 

(a) identifying an inhibitor of transcription and/or translation of an 
essential or conditional essential gene of an organism and/or an inhibitor of activity 

10 . of a-polypeptide encoded by a said gene- by a method as set out above wherein the 
essential or conditional essential gene is a bacterial, fungal or eukaryotic parasite 
essential or conditional essential gene; and 

(b) formulating the inhibitor thus identified with a pharmaceutically 

acceptable carrier or diluent; 
15 - . a method for treating a host suffering from a bacterial, fungal or eukaryotic 
parasite infection, which method comprises: 

(a) identifying an inhibitor of transcription and/or translation of an 
essential or conditional essential gene of an organism and/or an inhibitor of activity 
of a polypeptide encoded by a said gene by a method as set out above wherein the 

20 essential or conditional essential gene is a bacterial, fungal or eukaryotic parasite 
essential or conditional essential gene; 

(b) formulating the inhibitor thus identified with a pharmaceutically 

acceptable carrier or diluent; and 

(c) acimimstering to the host a therapeutically effective amount of an 

25 inhibitor thus formulated; 

an inhibitor of the invention, wherein the essential or conditional essential 
gene is a plant bacterial, plant fungal or plant pest essential or conditional essential 
gene; 

us e of such an inhibitor as a plant bactericide, fungicide or pesticide; 
30 - an inhibitor of the invention, wherein essential or conditional essential gene is 



a plant essential or conditional essential gene; 
use of such an inhibitor as a herbicide; 

a bacterium attenuated by a non-reverting mutation in ons or more genes 
identified by a method of the invention; 
5 - * * a vaccine comprising such a bacterium and a pharmaceutically acceptable 
carrier or diluent; 

- a bacterium as described above for use in a method of vaccinating a human or 
animal; 

use of such a bacterium for the manufacture of a medicament for vaccinating 
10 a human or animal; 

a method for raising an immune response in. a mammalian host, which 
method comprises the step of administering to the host a bacterium as set out above; 
a method for preparing an attenuated bacterium, which method comprises: 

(a) identifying a conditional essential gene in a bacterium by a method of 
15 the invention; and 

(b) introducing a non-reverting mutation into a thus-identified conditional 
essential gene of the bacterium, thereby to attenuate the bacterium; 

a method for the preparation of a vaccine, which method comprises: 

(a) identifying a conditional essential gene in a bacterium by a method of 
20 the invention; - - . 

(b) introducing a non-reverting mutation into a thus -identified conditional 
essential gene of the bacterium, thereby to attenuate the bacterium; and 

(c) formulating the attenuated bacterium with a pharmaceutically 
acceptable carrier or diluent; and 

25 - a method for raising an immune response in a mammalian host, which 
method comprises:* 

(a) identifying a conditional essential gene in a bacterium by a method of 
the invention; 

(b) introducing a non-reverting mutation into a thus-identified conditional 
30 essential gene of the bacterium, thereby to attenuate the bacterium; 
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(c) formulatiiig the attenuated bacterium with a pharmaceutical^ 
acceptable carrier or diluent; and 

(d) administering to the host the attenuated bacterium thus formulated. 

5 Brief Description of The Figures 

Figure 1 shows a diagrammatic representation of the "Gene Kelly" transposon 
consisting of ME (mosaic end IS sequences), binding sites for the SP6 and T7 RNA 
polymerases, homing endonuclease sites for IScel and PI-P^pI, an R6k origin of 
replication and a kanamycin resistance cassette. 
- -10- - - Figure 2 sets out the sequence of the "Gene Kelly" transposon. 

Figure 3 shows a diagr amm atic representation of the original epicentre EZ:Tn 
R6k ori Kan transposon with the oligonucleotide sequences overlaid. Black boxes 
represent matching sequence between PCR oligonucleotides and the epicentre 
transposon. 

1 5 Figure 4 sets out a graph showing the position of insertion in the LT2 genome 

of the 46 sequenced transposon mutants (position in base pairs in increasing number 
order against number of transposons). From the graph it can be seen that there is a 
fairly random distribution of insertions throughout the genome. 

Figure 5 sets out a diagram of each end of the transposon showing the relative 

20 ' position of the iPCR oligonucleotides with the restriction endonuclease cut sites and 
RNA polymerase promoters in transposed chromosomal DNA, which has been 
digested and religated. 

Figure 6 sets out a schematic diagram of the ligation capture method of 
recovering the ends of the transposon. 

25 Figure 7 sets out array hybridisation results comparing input and output pool 

data. Array Images, panel a, reveals comparable sections of the scanned microarrays 
resulting from the hybridisation of the labeled RNA target generated from the input 
and output pool restricted DNA. Sets of probes corresponding to three transposon 
mutants (a set of probes refers to probes synthesised in both the sense and anti-sense 

30 directions around the point of transposon insertion. These are shown as horizontal 



r 



-8- 

lines above, sense probes, and below, anti-sense probes, the disrupted locus), 1 
(corresponding to probes encompassing a transposon insertion within the aroA 
gene), 2 (corresponding to probes encompassing a transposon insertion within gene 
X) and 3 (corresponding to probes encompassing a transposon insertion within an 
5 intergenic region) are boxed in the input panel and the data extracted from the input 
and output arrays are compared in the adjacent panel b, Extracted Data. In panel b, 
dashed vertical lines shown between the sense and anti-sense probes refer to the 
position of JRsal restriction endonuclease recognition sequences. Boxes above and 
below the black line indicate genes either in the sense or anti-sense direction, 
10 respectively, relative to the published LT2 genome sequence. 

Description of the sequence listing 

SEQ ID NO: 1 sets out the sequence of the "Gene Kelly 1 ' transposon. 

SEQ ID NO: 2 sets out the sequence of primer 97, which was used in the 
1 5 construction of the "Gene Kelly" transposon. 

SEQ ID NO: 3 sets out the sequence of primer 98, which was used in the 
construction of the "Gene Kelly" transposon. 

SEQ ID NO: 4 sets out the sequence of primer 107, which can be used to 
carry out iPCR in a protocol for generating RNA run-offs. 
20 SEQ ID NO:- 5 sets out the sequence of primer 115, which can be used to 

carry out iPCR in a protocol for generating RNA run-offs. 

SEQ ID NO: 6 sets out the sequence of primer 116, which can be used to 
carry out iPCR in a protocol for generating RNA run-offs. 

SEQ ID NO: 7 sets out the. sequence of primer 108, which can be used to 
25 carry out iPCR in a protocol for generating RNA run-offs . 

SEQ ID NO: 8 sets out the sequence of primer 1 17, which can be used to 
carry out iPCR in a protocol for generating RNA run-offs . 

SEQ ID NO: 9 sets out the sequence of primer 118, which can be used to 
carry out iPCR in a protocol for generating RNA run-offs. 
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SBQ ID NO: 10 sets out the sequence of primer 113, which can be used in a 

• protocol for ligation capture recovery of "Gene Kelly" transposon ends. 

SEQ ID NO: 1 1 sets out the sequence of primer 1 14, which can be used in a 
protocol for ligation capture recovery of "Gene Kelly" transposon ends. 

5 

Detailed Description of The Invention 

The invention relates to a new transposon which is suitable for use in 
methods for the identification of essential and conditional essential genes.- The 
. transposon has been named the "Gene Kelly" transposon. 
1 o The transposon of the invention is typically a modified Tn5 transposon, 

although, in principle, any transposon may be modified so as to prepare a transposon 
according to the invention. The transposon of the invention has a combination of 
features which make it a versatile tool for use in a number of protocols for the 
identification of essential and conditional essential genes. We refer to these methods 
15 as transposon mediated differential hybridisation (TMDH) techniques. 

A transposon of the invention comprises an RNA polymerase recognition site 
(sometimes referred to as an RNA polymerase recognition sequence) and a homing 
endonuclease recognition site (sometimes referred to as a homing endonuclease 
recognition sequence). 

20 Typically,the RNA polymerase recognition site is located proximal to an end 

of the transposon, for example adjacent to mosaic ends (if the transposon has them). 
The RNA polymerase recognition site is typically oriented so that it directs 
transcription out of the transposon itself. Thus, DNA sequence flanking the 
integration site of the transposon may be transcribed. 

25 These transcribed sequence originate from DNA sequence fla nking a 

transposon insertion site. Such DNA sequences are therefore susceptible to insertion 

• and are unlikely to represent essential gene sequences. 

Sequences flanking a transposon insertion site may therefore be isolated from 
a library of transposon insertion mutants (generated using a transposon of the 
3 0 invention) and hybridised with an oligonucleotide airay which comprises probes 
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corresponding to open reading frames from an organism to be studied. If the library 
of insertion mutants comprises a transposon insertion in all of the non-essential genes 
of the organism, any probe in the oligonucleotide array to which none of the flanking 
sequences hybridise is likely to be a good candidate for originating from an essential 
"5 gene. Typically, oligonucleotide arrays suitable for use with a transposon comprise 
probes corresponding to all of the open reading frames from the organism in question 
and therefore potentially all of the essential genes of an organism may be identified 
simultaneously. 

Typically, a transposon of the invention comprises two RNA polymerase 
10 recognition sites located proximal to the ends, typically to the opposite ends, of the 
transposon. Preferably, both RNA polymerase recognition sites point out of the 
transposon, i.e. are capable of directing transcription or DNA sequence flanking the 
transposon insertion site. 

Preferred transposons of the invention comprise two diverse (different) RNA 
15 polymerase recognition sites, although the two sites may be the same. The use of 
two RNA polymerase recognition sites allows two separate pools of RNAs to be 
isolated from a library of Gene Kelly insertion mutants. One pool will correspond to 
DNA sequences flanking one side of the transposon insertion site and the other pool 
will represent sequences flanking the other side of the transposon insertion site (these 
20 may be referred to as right T and left-hand pools). The generation of these two 

separate pools may help to minimise the risk of an essential gene being incorrectly 
assigned as a non-essential gene. This is explained in more detail below. 

A transposon of the invention also comprises a homing endonuclease 
recognition site. Homing endonucleases are rare cutters, especially in bacterial DNA. 
25 Incorporation of recognition sites for such endonucleases into a transposon 

effectively permits the introduction of these sites into the genome being studied. 
Transposed DNA may be digested with the appropriate homing endonuclease and the 
resulting ends (if none are present in the bacterial genome) should therefore all 
originate from the Gene Kelly transposon. The fragments resulting from digestion 
30 with a homing endonuclease may then be digested with a restriction endonuclease 
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which cuts in the genome of organism, being studied. The resulting 
transposon:flanking sequence fragments may be rescued using a Hgation-capture 
technique described below, allowing the rapid and selective purification of regions of 
the genomic DNA of the organism being studied which originate from a site of 
5 transposon insertion. Such regions* can be used in hybridisation experiments with 
oligonucleotide arrays as outlined above and as described in more detail below. 

Preferably, if the transposon comprises two RNA polymerase recognition 
sites, it. will also comprise two homing endonuclease recognition sites. Typically, if 
• two homing endonuclease recognition sites are present, they will be diverse 

10 (different) homing endonuclease recognition sites. However,- they may be the same 
endonuclease recognition site. The use of two, typically diverse, homing 
endonuclease recognition sites in combination with two, typically diverse, RNA 
polymerase recognition sites may allow two separate pools of sequences flanking 
either side of the inserted transposons to be isolated separately i.e. a left-hand pool 

1 5 and a right-hand pool of flanking sequences may be separated. The generation of 
these two pools may help to minimise the risk of an essential gene being incorrectly 
assigned as a non-essential gene. This is explained in more detail below. 

" In addition, a transposon of the invention may incorporate a bacterial origin 
of replication. This allows plasmid-rescue of the complete transposon to be carried 

"20 out (plus .the flanking regions of chromosomal DNA around the insertion site). This 
may be achieved by digestion of genomic DNA isolated from a library of transposon 
insertion mutants (with a restriction endonuclease that does not cut in the transposon 
sequence or at least in the bacterial origin of replication in the transposon sequence), 
religation and then transformation into a strain of bacteria in which the origin of 

25 replication will function. A suitable bacterial origin of replication is the R6k origin 
of replication. 

A transposon of the invention thus comprises two critical features: (i) an 
RNA polymerase recognition site; and (ii) a homing endonuclease recognition site. 
Optionally, a bacterial origin of replication may be present. A more preferred 
30 version of the Gene Kelly transposon comprises two diverse RNA recognition sites 
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and two diverse homing endonuclease recognition sites. The full sequence of a 
preferred Gene Kelly transposon is set out in Figure 2 and SEQ ID NO: 1 . 

Transposons, sometimes called transposable elements, are mobile 
polynucleotides. The term transposon is well known to those skilled in the art and 
5 includes classes of transposons that can be distinguished on the basis of sequence 
organisation, for example short inverted repeats at each end; directly repeated long 
terminal repeats (LTRs) at the ends; and polyA at 3" ends of RNA transcripts with 5 ! 
ends often truncated. Some types of virus also integrate into the host genome, for 
example retroviruses, and may therefore be used to generate libraries of insertion 
10 mutants. However, transposons are typically preferred to viruses because issues of 
safety related to pathogenicity may be avoided. 

A transposon of the invention comprises an RNA polymerase recognition site 
(typically two) and a homing endonuclease recognition site (typically two). Any 
suitable transposon may be modified to produce a transposon of the invention. 
1 5 Suitable transposons for use in bacteria which can be modified to generate a Gene 
Kelly transposon include Tn3, y5, 7>il0, TnS, 7>*phoA, 7>?903, 7«917, Bacteriophage 
Mu and related viruses. Any of the above mentioned transposons may be modified 
to generate a transposon of the invention. Different parts of different transposons 
may be mixed and matched in a transposon of the invention. A particular preferred 
20 transposon for use in the invention is a modified TnS transposon. 

Suitable transposons for use in fungi which can be modified to generate a 
transposon of the invention include the Ty 1 element of Saccharomyces cerevisiae, 
the filamentous fungi elements (the filamentous fungi include agriculturally 
important plant pathogens such as Erysiphe graminis, Magnapbrthe grisea) such as 
25 * FotVPogoAske and Tcl/Mariner-hke elements (see Kempen and Kuck, 1998, 
Bioessays 20, 652-659 for a review of such elements). ' 

Suitable transposons for use in plants which can be modified to generate a 
transposon of the invention include Ac/Ds, Tam3 and other Tarn elements, cinA and 
spm. 



-13- 

Suitable transposons for use in animals -which can be modified to generate a 
transposon of the invention include P and hobo which may be used in Drosophila 
and Tel which can be used in Gaenorhabditis elegans. 

A preferred transposon of the invention is one which carries an antibiotic 
5 resistance gene (which may be useful in identifying mutants which carry a 
' transposon) conferring resistance to, for example, kanamycin, streptomycin and 
bleomycin. 

A transposon of the invention may comprise one, two or more recognition 
sites for an RNA polymerase. Preferred recognition sites are those for which the 
10 ' corresponding RNA polymerase is highly selective for initiation. Other preferred 
. recognition sites are those for which the corresponding RNA polymerase does not 
initiate transcription from sequences of the organism being studied. Preferred 
examples of RNA recognition sites are those recognised by bacteriophage RNA . 
polymerases, for example the recognition site for 17 RNA polymerase, T3 RNA 
1 5 polymerase or SP6 RNA polymerase, in particular the T7 RNA polymerase 

" recognition site. A preferred Gene Kelly .transposon will thus carry two of a T7 RNA 
polymerase recognition site, an SP6 RNA polymerase recognition site or a T3 RNA 
polymerase recognition site, for example a 17 RNA polymerase recognition site and 
an SP6 RNA polymerase recognition site. The recognition sites for these specific 
20 RNA polymeras es- are well known to those skilled in the art. 

The RNA polymerase recognition site may appear anywhere within a 
transposon for use in the invention. However, typically the RNA polymerase 
recognition site will be located proximal to one end of the transposon, i.e. proximal 
to one IS/ME. Typically, the 3 1 end of the RNA polymerase recognition site will be 
25 situated from one to 30, for example' from five to twenty base pairs away from the 5' 
end of one of the IS/ME. 

Preferred transposons of the invention comprise two RNA polymerase 
recognition sites, which generally will be different RNA polymerase recognition 
sites. For example, a transposon may comprise a T7 RNA polymerase recognition 
30 site and an SP6 RNA polymerase recognition site. * ( 
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A transposon of the invention may also comprise more than two RNA 
polymerase recognition sites, for example three or four RNA recognition sites. More 
than two recognition sites may be useful in a situation where the genome of an 
organism being studied possesses recognition sites for one of the RNA polymerase 
5 recognition sites present in the transposon. Thus, different RNA polymerase sites in 
one transposon may be suitable for use in different genomes. The specific RNA 
polymerase recognition sites used may vary with the particular organism being 
studied. A single transposon may therefore be suitable for use in a number of 
different organisms. 

10 A transposon of the invention comprises a homing endonuclease recognition 

site. Preferably, two such sites axe present which may be the same or diverse. 
Homing endonucleases are also known as intron or intern encoded endonuclease. 
They are encoded by genes with mobile, self-splicing introns or inteins (protein 
introns). Any homing nuclease recognition site may be used, for example I-Scel, PI- 
15 Pspl or I-Ppol, or preferably a combination of any two thereof. 

A homing endonuclease recognition site may be located anywhere in the 
transposon, but typically it is situated 5 ! to an RNA polymerase recognition site, i.e. 
further into the transposon than the RNA polymerase recognition site, Preferred 
* transposons of the invention may comprise two homing endonuclease recognition 
20 sites which generally will be different homing endonuclease recognition sites, for 
example a l-Scel and a PI-P^pI recognition site. This is usually the case where the 
transposon comprises two RNA polymerase recognition sites. 

•A transposon of the invention may, however, comprise more than two 
homing endonuclease recognition sites, for example three or four homing 
25 endonuclease recognition sites. More than two recognition sites may be useful in a 
situation where the genome of an organism being studied possesses recognition sites 
for one of the homing endonuclease recognition sites present in the transposon. 
Thus, different homing endonuclease recognition sites in one transposon may be 
suitable for use in different genomes. The specific homing endonuclease recognition 
30 sites used in the transposon may vary with the particular organism being .studied. A 
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single transposon may therefore be suitable for use in a number of different 
organisms. 

Typically, a transposon of the invention will comprise the same number of 
RNA polymerase recognition sites as homing endonuclease recognition sites, ideally 
5 two of each. Each homing endonuclease recognition site will typically be located 5* 
to an RNA polymerase recognition site, for example such that the 3' end of the 
homing endonuclease recognition site is from one to 30, for example, from five to 
twenty, base pairs away from the 5' end of one of an RNA polymerase recognition 
site. 

-10"" - A transposon of the invention may also comprise a bacterial origin of 

replication. A preferred example of a suitable bacterial origin of replication is the 
R6k origin of replication. The R6k origin of replication is capable of functioning in a 
pir* strain of bacteria. 

A transposon of the invention may be used in methods for the identification 

15 of essential or conditional essential genes, i. e. use of a transposon of the invention in . 
a method for the identification of an essential gene or a conditional essential gene is 
provided according to the invention. The methods described below for the 
identification of essential or conditional essential genes using a transposon of the 
invention can conveniently be referred to as transposon mediated differential 

• 20 hybridisation (TMDH). TMDH techniques are described in detail in 

WO- A-0 1/0765 1 (PCT/GB00/02879) and the transposon of the invention may be 
used in any of the methods set out therein. 

In methods for the identification of essential gene, typically the first step is 
the provision of a library of transposon insertion mutants. Libraries of insertion 

25 mutants using a transposon of the invention may be generated according to any 
method known to those skilled in the art. For example, libraries of bacterial 
transposon insertion mutants can be constructed using either plasmid or 
bacteriophage vectors con tainin g the transposon and a selectable marker. 
Bacteriophage \ eg. XTnphoA can be used to infect a suitable recipient bacterial 

3 0 strain, for example E. coli XAC. This K coli strain has a suppressor mutation which 
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• prevents the bacteriophage from replicating and subsequently lysing and also 
contains an antibiotic resistance gene to allow selection of colonies containing 
transposed chromosomal DNA. The vector contains mutation(s) preventing 
integration of the X chromosome into the bacterial host chromosome and thus the 

-5 growth of false -positive colonies withouta mutate~d K coli geheis prevented; 

Cultures of the recipient strain are grown in enriched medium (eg. Luria Broth) and 
cells in mid log phase of growth are infected with the X transposon vector for 1 hour 
at 37°C. Aliquots of the infected cells are plated out on L-agar supplemented with 
the appropriate selective antibiotic and grown overnight at 37°C. These colonies 
10 constitute a transposon library and can be further analysed by the TMDH procedure 

• described in this application. 

Alternatively, transposome complexes comprising the transposon in a 
complex with a transposase may be generated and electroporated into a suitable 
electrocompetenthost. Suitable techniques for preparing transposomes and for 
15 electroporating transposomes into host cells are well known to those skilled in the 
art. 1 

Growth of such libraries results in the generation of potentially thousands of 
insertion mutants all of which mutants carry insertion that are, of necessity, in genes 
that (when mutated) do not result in the death of the cell ie. are non-essential genes. 
20 Each mutant in a suitable transposon insertion library may carry one 

transposon insertion. However, a mutant may carry more than one transposon 
insertion, for example two, three, four, five, ten or twenty transposon insertions A 
transposon insertion mutant library suitable for use in the invention will comprise at 
least one transposon insertion mutant for at least 60%, at least 70%, typically at least 
25 80%, preferably at least 90%, more preferably at least 95%, even more preferably at 
■ least 99%, or most preferably substantially all of the non-essential genes in the 
organism being studied. Preferably the library will be a saturating library, i.e. the 
library comprises a transposon insertion mutant for substantially all genes of the 
organism that when mutated give rise to viable organisms. 
30 A transposon insertion could be in an open reading frame of a gene or in a 

regulatory sequence of gene. 
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Any non-essential gene in the transposon insertion library may be represented 
by more than one insertion mutant, for example two, three, four, five or up to ten 
insertion mutants, each carrying transposon insertions at the same or different sites in 
the non-essential gene or carrying insertions at the same site in different orientations. 
5 Preferred libraries will have, on average, more than one different transposon insertion 
mutant for each non-essential gene represented in the library, for example at least two 
on average, at least four on average, at least 5 on average or at least 10 on average 
different transposon insertion mutants for each non-essential gene represented in the 
library. 

1.0 Some regions of a particular genome may be inaccessible to insertion by a 

particular transposon, for example because of a particular secondary or tertiary 
structure which is inaccessible to a particular transposon. Thus it may be . _ 

advantageous to combine two transposon libraries, thereby increasing the probability 
of obtaining transposon insertions in a greater number of genes. For example, in the 

15 case of bacterial libraries, a library generated with Gene Kelly transposons based on a 
Tn5 transposon and a library generated with a Gene Kelly transposons based on a 
• 7>2l0 transposon could, for example, be combined. 

Generally, flanking sequence will be isolated from at least 60%, for example 
at least 70%, at least 80%, preferably at least 90%, more preferably at least 95% and . 

20 most preferably at least 99% of the transposon insertion mutants in a particular 
library of mutants. 

In the method of the invention chromosomal DNA is prepared from the 
library of transposon insertion mutants. Techniques for the isolation of chromosomal 
DNA, alternatively referred to as genomic DNA, are well known to those skilled in 

25 the art. The transposons of the invention allow for a number of different techniques 
to be used for the generation of RNA target sequences from the isolated genomic 
DNA. 

In one version of TMDH, the chromosomal DNA thus prepared is then 
' digested with a restriction endonuclease. The restriction endonuclease is one which 
30 is capable of cutting at a recognition site which is located in the transposon at a 
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position 5 1 to the KNA polymerase recognition site (which is located in the 
transposon) and 3' to the KNA polymerase recognition site in the chromosomal DNA 
flanking the transposon insertion site. 

In a modification of this protocol, two restriction endonucleases may be used: 
5 a first restriction enzymewhich cuts" "S 1 to the "Rl^ polymerase recd^iitioh site 
(which is located in the transposon) in the transposon itself; and a second restriction 
enzyme which cuts 3' to the RNA polymerase recognition site in the chromosomal 
DNA flanking the transposon insertion site. If two restriction enzymes are used in 
this way, they may be used simultaneously or one after the other in either order. 
10 The exact restriction enzyme(s) to be used will depend on the sequence of the" 

transposon. However, typically an restriction endohuclease is used which has 
recognition sites that appear frequently within the genome of the organism being 
studied. Thus, a series of DNA fragments is generated, some of which comprise an 
RNA polymerase recognition site fused to a portion of flanking sequence, i.e. non- 
15 essential gene sequence. 

Generally, suitable restriction endonucleases will have six base pair, five base 
pair or preferably four base pair recognition sequences. Suitable examples of four 
base pair cutters are set out in Table 1 below: 

20 . Table 1. Examples of 4bp recognition type II restriction endonucleases suitable for 
use in TMDH 



Enzyme 


Recognition Site 


Enzyme 


Recognition Site 
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C'CGC 


Arise! 


TTAA 




GGC t G 




AAT,T 


AM 


AtfCT 


Mspl 


C'CGG 




TC t GA 




GGC.C 


BfaL 


C'TAG 


NlaKL 


'CATG 




GAT,C 




GIAC, 


Bstul 


CG'CG 


Rsal 


GT'AC 




GC.GC 




CA,TG 
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Dpnl 


l GATC 


Sou3sl 


l GATC 




CTAG, 




CIAG, 


HaeJU. 


GG'CC 


Taql 


VCGA 




CC t GG 




AGC T T 


Hinpl 


&CGC 


Tsp509 


l AATT 




CGC,G 




TTAA t 


HJiaL 


GC&C 








C.GCG 
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The resulting fragments comprise an RNA polymerase site adjacent to non- 
essential gene sequence. The fragments may optionally be size selected. If size 
selection is carried out, fragments with a size of from about 100 bp to about 2000bp 
or preferably of from about 200 bp to about 600 bp may be isolated, for example 

10 from a gel, and purified. The smaller the fragments isolated, the smaller the chance 
of the RNA target sequences including sequences from genes which lie next to genes 
which have been interrupted by transposons. If such adjacent sequences were from 
essential genes, there is the possibility that essential gene sequences could be 
identified as non-essential gene sequences. Thus, size fractionation may reduce the 

15 amount of false non-essential gene sequences. 

RNA target sequences are generated from the DNA (host organism) 
sequences that flank the transposons, i.e. those regions corresponding to non- 
essential gene sequences. That is, the transposon: flanking sequence fragments are 
then used to generate the RNA sequences. Thus, following digestion, the 

20 transposon:flanking sequence fragments (the "target") may be transcribed. 

Optionally, the transposonrflanking sequence fragments may be amplified prior to 
transcription, for example by PCR. Preferably this is carried out by iPCR (inverse 
PCR). 

Transcription is carried out by in vitro transcription from the RNA 
25 polymerase recognition sequence. Techniques for carrying out in vitro transcription 
are well known to those skilled in the art and any suitable technique may be used. In 
essence, an RNA polymerase and ribonucleotides are used. 



f 
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The RNA target sequences so-generated may then be hybridised with 
oligonucleotide arrays. Alternatively, the RNA target sequences may be reverse 
transcribed to produce cDNA target sequences. Reverse transcription may be primed 
using an oligonucleotide having a sequence based on the transposdn sequence 
5 immediately -3' to the site of trans eription initiation. Techniques for-carrying- out- 
reverse transcription are well known to those skilled in the art and any suitable 
technique may be used. In essence, a reverse transcriptase and deoxyribonucleotides 



1 0 carried out in the presence of one or more labelled ribonucleotides or one or more 

labelled deoxyribonucleotides respectively, so that the resulting KNA or cDNA target 
sequences are labelled. 

Suitable labels include radioactive labels, for example 32 P, 33 P or 35 S, or non- 
radioactive labels, for example an enzyme, a fluorescent label or biotin. 

1 5 . Fluorescent labels are preferred, for example a water-soluble fluorescent dye 

such as CyS™ or CyS™ or a fluorescein-tagged compound such as FluorX™ (the 
NHS ester of carboxyfluorescein with an extended linker arm), fluorescein 
isothiocyanate (FITC) or 5-([4,6-DicMorotriazin-2-yl]amino)fluorescein (DTAF). 
Generally, it will only be necessary to have one of the four ribonucleotides or 

20 deoxyribonucleotides labelled. 

The techniques described above allow the isolation of sequences flanking the 
transposons in a library of transposon insertion mutants. Thus, a pool of fl ankin g . 
sequences is generated collectively referred to as the RNA (or cDNA) target 
sequences. Although fragments in the pool are generated from only one side of the 

25 transposons, a transposon is capable of inserting at any particular locus (that can be 
disrupted) in either orientation. Thus, particularly in a saturating transposon 
insertion library, many loci will be represented by mutants carrying insertions in both 
orientations. Therefore, the RNA (or cDNA) target sequences generated according to 
the TMDH method of the invention will, for many loci, comprise fla nkin g sequence ' 

30 in both orientations. 



are used. 



Preferably, the transcription reaction or the reverse transcription reaction is 
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In a modification of this technique, the fragments comprising the RNA 
polymerase site fused to non-essential gene sequence may be amplified. 

Amplification may be carried out by ligating linkers, preferably vectorette 
units, to the fragments. If linkers are ligated to the fragments, the resulting fragments 
5 - may be re-purified for example through a gel or by using spun-column 

chromatography. PGR may then carried out using the fragments as templates with a 
primer pair comprising an oligonucleotide specific for a transposon sequence and a . 
second oligonucleotide specific for a linker (eg. a vectorette) sequence. The use of 
transposon- and vectorette-specific PCR primers results in the specific amplification 
10 of sequences that are adjacent to the sites of transposon insertion. 

Alternatively^ the fragments may be amplified by cycle primer extension. 
The use of a suitable labelled oligonucleotide primer can allow the amplification of 
sequences adjacent to the sites of transposon insertion. Those labelled amplified 
sequences can be used directly in hybridisation experiments. 
1 5 Alternatively, the fragments may be amplified by inverse PCR (iPCR). Thus, 

the fragments may be self-ligated and subsequently amplified using transposon 
specific primers. 

Suitable techniques for carrying out self-ligation are well known to those 
skilled in the art. Any suitable ligase may be used, for example T4 DNA ligase. 

20 Ligation reactions may be carried out for from 6 to 24 hours, for example from 12 to 
16 hours at a temperature of from 10°C to 20°C, for example at about 16°C. 

Self-ligated molecules are then amplified using iPCR Techniques for 
carrying out iPCR are well known to those skilled in the art and may be carried out- 
according to any suitable technique. • Typically, iPCR is carried out using two 

25 olignucleotides which bind divergently at a location 5' to the RNA polymerase 

recognition site. Preferably the oligonucleotides bind divergently to a location which 
is 3' to the restriction endonuclease recognition site in the transposon. That is, the 
two olignucleotide recognition sites are preferably located on .the transposon between 
the .restriction endonuclease recognition site and the RNA polymerase recognition 

30- site. 
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When losing iPCR techniques, there is the possibility that, a "stuffer" 
fragment may ligate into the self-ligation reaction, which will be amplified along 
with the transpos on-disrupted sequence. If this material were to be used in 
subsequent generation of the RNA target sequences, the stuffer sequence could create 
.5. ..nonrspecific background.signal.as it would.also.he hybridized to ihe_high. density., 
array. La order to remove this stuffer fragment, the sequences amplified in iPCR can 
be redigested with whichever enzyme was used to isolate the transposon-flanking 
sequence fragments in the first place. This results in the release of the stuffer 
fragments which can be removed from the transposomflanking sequence fragments. 

10 Removal of the stuffer fragments can be facilitated if a biotinylated primer is used in 
iPCR The biotinylated transposon:flanking sequence fragments can then be 
removed from the stuffer fragments using a magnetic-bead-streptavidin conjugate. 

Additional methods for amplifying transposon:flanking sequence fragments 
include, for example, splinkerette-PCR, targetted gene walking PCR, restriction site 

1 5 PCR, capture PCR, panhandle PCR and boomerang DNA amplification (for a review 
of these techniques seeHui et al, Cell Mol. Life Sci. 54 (1998) 1403-1411). 

It is possible- that the particular restricted endonuclease used will not cut 
within the gene in which the transposon is inserted, or cuts at a large distance, for 
example more than 2kb, away from the insertion site. Therefore, if size selection is 

20 carried out, sequences from these genes may be lost. Thus, the generation of 

fragments may be carried out several times, each time using a different restriction 
endonuclease and the resulting fragments may subsequently be pooled. The greater 
the number of enzymes used to make fragments, the greater the likelihood of 
sequences from non-essential genes being represented in the final pool of fragments. 

25 In a further modification of TMDH method, the chromosomal DNA isolated 

from a transposon insertion library may be divided into a number of aliquots/ Those 
aliquots may then each be separately digested with a different restriction 
endonuclease which is capable of cutting at a recognition site which is located in the 
transposon at a position 5' to the RNA polymerase recognition site and in the 

30 chromosomal DNA flanking the transposon 3 1 to the RNA polymerase recognition . 
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site (which is in the transposon). The chromosomal DNA may be separated into, for . 
example two, three, four, five or ten aliquots which are each separately digested with, 
a different restriction endonuclease. Preferred restriction enzymes are as set out in 
Table 1 above. 

5 Thus, for example, two or three aliquots of the chromosomal DNA may be 

separately digested with different suitable restriction endonucleases, for example two 
or three of HaeHl, Hha\ Bpych4IV and RsdL 

If the TMDH protocol is used in this modified format, the different aliquots 
may be repooled after digestion and treated together in the subsequent steps of 
1 0 TMDH. Alternatively, the digested aliquots may be treated separately in the 

subsequent steps. If this TMDH format is adopted, a number of pools of RNA target 
sequences result. Each pool of RNA target sequences may be labelled with a 
different, for example fluorescent, label. 

In an alternative protocol for the preparation of RNA/cDNA target sequences, 
15 the genomic DNA isolated from a library of transposon mutants is digested with.a 
homing endonuclease. Homing endonuclease recognition sites are rare and therefore 
• any "ends" generated by digestion with a homing endonuclease should originate from 
the transposon. Fragments resulting from digestion with a homing endonuclease are 
then further digested with a restriction endonuclease which does not- cut in the 
20 transposon, for example a restriction enzyme as described above, but which does cut 
within the genomic sequence flanking the transposon ends (i.e. fl ankin g the 
transposon insertion site). The resulting transpbsoniflanking sequence fragments 
may then be rescued by annealing a biotinylated linker to them and then isolating the 
■ biotinylated transposomflanking sequence fragments with streptavidin-coated 
25 particles, for example streptavidin-coated magnetic beads. The linker anneals to the 
homing endonuclease recognition site end of the fragments. 

Alternatively the ends bearing the homing endonuclease recognition site may 
be rescued by labelling with digoxygenin and isolating the labelled fragments with an 
antibody raised against digoxygenin. The step of digestion with a restriction enzyme 
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may be carried out after the transposontfl anking sequence fragments have been 
isolated. 

The use of homing endonucleases may allow the step of isolating genomic 
DNA from a library of transposon mutants to be eliminated. Thus, the library of 
"5 "mutants" may "be digested " directly with aliomirig eM'onuclease. Typically; an "extract 
of the library may be generated. For example, if the library is a bacterial library, the 
baterial cells may be lysed before digestion with a homing endonuclease is carried 
out. Once digestion with a homing endonuclease has been carried out, transposon 
ends (transposomflanking sequence fragments) may be recovered as described above. 
.10 The transposon:flanking sequence fr&gments are then used to generate the 

RNA target sequences by carrying out in vitro transcription from the RNA 
polymerase recognition site. In vitro transcription can be carried out as described 
above. If, after cutting with the homing endonuclease, a further restriction enzyme is 
not used to cut within the flanking sequences, in vitro transcription may be carried 
1 5 . out in the presence of a dideoxyribonucleotide. That allows transcription to be 
terminated, reducing the risk of the transcribed sequence comprising portions of 
genes adjacent to the gene into which insertion has taken place. 

The sequences which comprise the RNA target sequences (whichever of the 
- methods described above is used to generate them) may be used for hybridisation 
20 with olignucleotide arrays. Oligonucleotide arrays used in the TMDH protocol of the 
invention are preferably high information content arrays. 

Oligonucleotide arrays suitable for use in the invention may comprise 
sequences from one or more loci of a genome. Preferably suitable oligonucleotide 
arrays will represent at least 80% of all open reading frames (ORFs), more preferably 
25 at least 90% of all ORFs, for example 95% of all ORFs, even more preferably~99% 
of all ORFs or substantially all ORFs of the genome represented on the 
oligonucleotide array. 

By high information content array is meant an array in which there are a high 
number of probes covering the locus, loci or genome represented by the array. For 
30 example, in a high information content array there may be a probe, for example, for 
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every 30 to 500 base pairs of the locus, loci or genome represented by the array. 
Preferably there will be a probe, for every 60 to 250 base pairs of locus, loci or 
genome represented in the array, for example about every 100 base pairs. Probes 
may overlap, for example by 1, 2, 3, 4, 5, up to 10, up to 20, up to 30, up to 46 or up 
5 to 50 bases. 

The olignucleotide probes on the array are, for example, from about 8 or 9 to 
about 150 nucleotides in length, preferably from about 30 or 50 to about 100 
nucleotides in length or more preferably about 60 nucleotides in length. 

The oligonucleotide probes used in the array will typically be designed on the 
10 basis of the wild type sequence of the organism being studied. The oligonucleotide 
probes may be designed so that each probe has minimal or substantially no cross- 
hybridisation with other sequences in the genome from which the probes originate. 
The BLAST program can be used to design suitable probes (Altschul ei al, J. Mol. 
Biol. 215, 403-410). 

1 5 Methods for making oligonucleotide arrays are well known to those skilled in 

the art. 

Probes which show no hybridisation or substantially no hybridisation (there 
may be a low level of background non-specific hybridisation) with the RNA target 
represent sequences that are unlikely to have been disrupted by a transposon insertion 
20 event and consequently are strong candidates for sequences corresponding to 
essential.genes. 

However, it is theoretically possible for oligonucleotide probes within the 5' 
or 3 '-termini of essential genes to show a hybridisation signal with the TMDH 
protocol. For example, if a transposon insertion occurs in a non-essential gene 

25 adjacent to an essential gene, RNA target sequences may be generated from this 
transposon corresponding to both non-essential and essential gene sequences as a 
result of restriction sites lying within the essential gene. The resulting labelled target 
will not only comprise DNA corresponding to the non-essential gene (that has been 
disrupted), but will also extend into the adjacent essential gene up to the restriction 

30 site. The result of hybridising this labelled target to the oligonucleotide array will be 
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appear as "bleed through" of signal to probes on either the 5' or 3' end of the 
essential gene, up to the restriction site used for the TMDH protocol. 

To address this potential source of mis-assi gnm ent of essential genes, the 
restriction endonuclease digestion TMDH protocol described above may be carried 
- 5 -with more- than one aliquot of the isolated genomic DNA,-for- example two or-three, 
whereby each aliquot is digested with a different restriction endonuclease (which 
have different recognition motifs). The more aliquots digested, with more restriction 
sites that are used to generate target sequences, the more statistically unlikely it is 
. that all of them will result in labelled RNA target sequences that "bleed through" into 

10 essential genes. The pools of RNA target sequences derived from the different 
digestions can be hybridised to the same or, preferably, different oligonucleotide, 
arrays if they were generated using different labels, or alternatively may be 
hybridised to copies of the same or, preferably, a different array. The analysis of the 
resulting multiple array hybridisation patterns will remove any ambiguity on the site 

15 of transposon insertion. 

A different modification to the TMDH protocol, applicable to both the 
standard restriction endonuclease and the homing endonuclease approaches, to 
minimize the mis-assignment of essential genes as non-essential gene is to isolate 
two pools of RNA target sequences which each originate from different sides of the 

20 transposons. This approach is illustrated in Figures 5 and 6. Figures 5 and 6 show 
Gene Kelly transposons which comprise two RNA polymerase recognition sequences 
(although in Figure 6 the two ends are shown superimposed).^ 

In figure 5 one pool of RNA target sequences (from one side of the 
transposons) is generated by carrying out in vitro transcription using T7 polymerase 

25 and a second pool of RNA target sequences (from the other side of the transposon) is 
generated by carrying out in vitro transcription using T3 or SP6 polymerase. 

Similarly, in figure 6 one pool of RNA target sequences (from one side of the 
transposons) is generated by carrying out in vitro transcription using T7 polymerase 
and a second pool of RNA target sequences (from the other side of the transposon) is 

30 generated by carrying out in vitro transcription using T3 or SP6 polymerase. The 
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ends lying 5' to the RNA polymerase recognition site (which are rescued using 
streptavidin coated particles) are generated via digestion with homing endonucleases. 
The homing endonuclease recognition site associated with each KNA polymerase 
recognition site may be the same or different homing endonuclease recognition sites. 
5 The two pools of RNA target sequences can be hybridised to the same 

oligonucleotide array if they were generated using different labels, or alternatively 
- each may be hybridised to a separate copy of the same array. Probes in the 
oligonucleotide array which shows no hybridisation to either pool are likely to 
correspond to essential genes. 

10 Where an essential gene sequence is isolated in one of the RNA/cDNA pools 

because it lies close to a non-essentiai gene sequence flanking a transposon insertion 
site, hybridisation to a probe on the oligonucleotide array will be observed even 
" though that probe corresponds to an essential gene. However, that probe will show 
no hybridisation with the other pool of RNA target sequences which comprise the 

1 5 flanking sequence from the other side of the transposon. Thus, in this type of TMDH; 
a probe in the oligonucleotide array to which at least one of the pools of RNA target 
sequences does not hybridise is likely to correspond to a gene that has not been 
disrupted by a transposon and may therefore be assigned as an essential gene. 

In the methods described above (and also in those described below for the 

20 identification of conditional essential genes), the pools of RNA sequences may be 
directly hybridised to arrays as set out above. Alternatively, before hybridisation is 
carried out, the pools of RNA sequences may be subjected to reverse transcription to 
generate pools of cDNAs. The sequence of the particular transposon of the invention 
used to generate the library of mutants may be used to design oligonucleotide 

25 primers suitable for priming reverse transcription or, alternatively, random primers 
. may be used. A label may be included in the reverse transcription reaction so that 
labelled cDNA pools are generated. The cDNA pools may be then be hybridised to 
arrays as is described above in relation to RNA pools. 

The TMDH methods described above may also be used for the identification 

30 of conditional essential genes. Conditional essential genes are those which are not 
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absolutely essential for bacterial survival, but are essential for survival in particular 
environments e.g. for growth/proliferation, in a host (in the case of a pathogenic 
bacterium) or for survival at elevated temperatures. Such environments are described 

* here as conditional restraints. 

5 - In orderto isolate conditional-essential ^ genes,- a library of^transposon -mutants 

is generated under control conditions (eg. growth at 37°C in complete media); The ■ 
library of mutants is then subjected to some conditional restraint. For example, the 
library of mutants can be inoculated in a suitable host, if it is a pathogen. 
Alternatively, the library of mutants can be grown at an elevated temperature. After 
10 the library of mutants has been subjected to the conditional restraint it can be 
recovered. 

The library of mutants may be recovered by, for example, recovering tissue 
such as the livfcr and/or spleen from a host and plating out an extract derived from 
that tissue on growth medium. All- surviving colonies on the plate represent mutants 

15 where a transposon insertion has occurred in a non-essential sequence. The surviving 
colonies can be pooled to give an output library of mutants which can then be 
subjected to one of the TMDH methods set out above. 

Alternatively, non-essential DNA sequences may be recovered directly from 
tissue (such as the liver and/or spleen) of an infected host without an intervening step 

20 of plating out an extract of the isolated tissue. In this regard, the presence of one, or 
preferably two, homing endonuclease recognition sites in the transposon is crucial. 
Thus, the tissue may be subjected to digestion with a homing endonuclease for which 
there is a corresponding recognition site in the relevant transposon. Typically, the 
tissue would be homogenised before digestion. Transposon ends (i.e. 

25 transposon:flanking sequence fragments) may then be recovered from the tissue. 
This may be achieved by annealing biotinylated linkers to the transposon ends and 
recovering the resulting biotinylated transposon ends with, for example, 
strepatavidin-coated beads. The isolated transposon:flanking sequence fragments 

* may then be digested with a restriction endonuclease which cuts in the sequence 
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flanking the transposon insertion site. Alternatively, the fragments may be digested 
with a restriction enddnuclease prior to recovery from the tissue. 

The library of mutants that have been exposed to the conditional restraint will 
lack mutants which carry transposons in those genes essential for growth under the 
5 conditional environment, for example growth/proliferation in a host organism. 

The control and conditional restraint libraries can then be subjected to the 
• TMDH protocols described above using the a transposon of the invention. Clearly, if 
transposon:flanking sequence fragments have been isolated directly, fox example 
from a host tissue using capture of, those fragments enter a TMDH protocol at the . 
.10... stage of preparing KNA by transcription from the KNA polymerase site(s). 

It is not necessary to use the same TMDH protocol for each library. The two 
resulting KNA target sequence libraries may then be hybridised separately to high 
density oligonucleotide arrays. Alternatively they can be hybridised to the same 
array, if the control and conditional restraint libraries are differentially labelled for 
15 example. 

Comparison of the results given with the control and the conditional restraint 
libraries will allow the identification of genes which permit survival in the 
conditional restraint. Genes identified as essential for survival in the conditional 
restraint library, but not identified as essential for survival under control conditions 

20 should represent genes that are essential for survival under the conditional restraint 
conditions . In particular, probes which show hybridisation with KNA target 
sequences from the input library but which show no hybridisation or substantially no 
hybridisation (there may be a low level of background non-specific hybridisation) 
with KNA target sequences from the output library are strong candidates for 

25 sequences corresponding to conditional essential genes. The same "bleed through" 
considerations apply as set out above and the modified TMDH protocols for 
overcoming such "bleed through" may need to be used. 

In the case of the analysis of conditional mutations in a pathogen, a library of 
Salmonella typhimurium transposon mutants, for example, can be used to infect a 

30 mouse. Following infection, bacteria target to livers and spleens and the course of 
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infection can be conveniently followed by performing viable bacterial counts on 
those organs. The bacteria recovered from the livers and spleens can be grown on 
suitable plates. In the case of the conditional restraint at elevated temperature, a 
transposon-tagged library can be grown at 42°C. 
5 0ther conditional restraints-include growthof antibiotic resistantbatteriain 

the presence of antibiotics. This may reveal genes which are essential for antibiotic 
resistance. Such genes would be targets for drugs with the ability to lower bacterial 
resistance to particular antibiotics. Organisms could be grown in the presence of 
carcinogens, UV or other agents that cause oxidative stress and thus genes that confer 
' . 10. resistance to growth under those conditions may be identified. 

Potential essential gene sequences and conditional essential gene sequences 
identified by a TMDH strategy using a transposon of the invention may be verified 
using a method based on allelic exchange. This technique is particularly suitable for 
analysis of bacterial genes. PCR primers can be used to generate left- and right-arm 
15 sequences corresponding to the target gene sequence and ligated with a 

kanamycin-resistance encoding gene cassette. The resulting cassette can be 
introduced into a suicide vector, for example a plasmid-based vector, which is unable 
to replicate in a host bacterium. 

In the case of a candidate essential gene, the resulting construct can be 
20 introduced into the bacterial strain from which the candidate gene originates. If the 
target gene is essential, it should be impossible to isolate allelic-exchange mutants 
that have a disrupted version of the target gene. Ia the case of a candidate 
" conditional essential gene, the essential gene can be introduced into the bacterial 
strain from which the candidate gene originates. Allelic-exchange mutants can be 
25 isolated and subjected to growth under the conditional restraint. If the candidate 

gene is a conditional essential gene, it should not be possible for the allelic-exchange 
mutants to survive under the conditional restraint. 

Similar experiments may be performed for other organis m s. 
The use of bioinfonnatics may allow the rapid isolation of further essential 
30 and conditional essential genes. A gene identified by TMDH using a transposon of 
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the invention may be used to search databases containing sequence information from 
other species in order to identify orthologous genes from those species. Genes so 
identified can be tested for being essential or conditionally essential using the genetic 
techniques described above. For exampie, an E coli gene is identified as essential 
5 using a method as described above. This may allow the identification of a putative 
orthologue from Salmonella. That Salmonella gene may be tested by allelic 
exchange and the construction of conditional mutants in Salmonella as described 
above. Further orthologues may be identified in more distantly related organisms, 
for example from Plasmodium species. 
10 Suitable bioinformatics programs are well known to those skilled in the art. 

For example, the Basic Local Alignment Search Tool (BLAST) program (Altschul et 
al, 1990, J. Mol. Biol. 215, 403-410. and Altschul et al, 1997, Nucl. Acids Res. 25, 
3389-3402.) may be used. Suitable databases for searching are for example, EMBL, 
■ GENBANK, TIGR, EBI, SWISS-PROT and trEMBL. 
1 5 Organisms that may be used in the invention are those for which it is possible 

to carry out transposon mutagenesis and thus, those that can give rise to a library of . 
transposon mutants. Clearly, if the genome is bigger, more mutants will have to be 
produced in order to give a better chance of achieving saturation mutagenesis. 
Suitable organisms include prokaryotic and eukaryotic organisms. Suitable 
20 prokaryotes include bacteria. Preferred bacteria are those which are animal or human 
or plant pathogens. 

The bacteria used may be Gram-negative or Gram-positive. The bacteria may 
be for example, from the genera Escherichia, Salmonella, Vibrio, Haemophilus, 
Neisseria, Yersinia, Bordetella, Brucella, Shigella, Klebsiella, Enterobacter, 

25 Serracia, Proteus, Vibrio, Aeromonas, Pseudomonas, Acinetobacter, Moraxella, 
Flavobacterium, Actinobacillus, Staphylococcus, Streptococcus, Mycobacterium, 
Listeria, Clostridium, Pasteurella, Helicobacter, Campylobacter, Lawsonia, 
Mycoplasma, Bacillus, Agrobacterium, Rhizobium, Erwinia oxXanthomorvas. 
Examples of some of the above mentioned genera are Escherichia coli*- a cause of 

30 diarrhoea in humans; Salmonella typhimurium - the cause of salmonellosis in several 
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animal species; Salmonella typhi - the cause of human typhoid fever, Salmonella 
enteritidis - a cause of food poisoning in humans; Salmonella choleraesuis - a cause' 
of salmonellosi3 in pigs; Salmonella dublin - a cause of both a systemic and 
diarrhoea! disease in cattie, especially of new-born caives; Haemophilus influenzae - 
5 a-cause-of meningitis; JVeis^r/a-g-o^^ 

enterocolitica - the cause of a spectrum of diseases in humans ranging from 
gastroenteritis to fatal septicemic disease; Bordetella pertussis - the cause of 
whooping cough; Brucella abortus - a cause of abortion and infertility in cattle and a 
condition known as undulant fever in humans; Vibrio cholerae - a cause of cholera; 
1 0 Clostridium tetani - a cause of tetanus; Bacillus anthracis - a cause of anthrax. 

Suitable eukaryotes include fungi, plants and animals. Preferred eukaryotes include 
animal or human parasites and plant pests. 

Suitable fungi include the animal pathogens including Candida albicans - a 
cause of thrush, Trichophyton spp. - a cause of ringworm in children, athlete's foot in 
1 5 adults . Other suitable fungi include .the plant-pathogens Phytophthora infestans, 
Plasmopara viticola, Peronospora spp., Saprolegnia spp., Erysiphe spp., 
Ceratocystis ulmi, Monilinia fructigena, Venturia inequalis, Claviceps purpurea, 
Diplocarpon rosae, Puccinia graminis, Ustilago avenae. 

Suitable animal parasites include Plasmodium spp., Trypanosoma spp., 
20 Giarda spp., Trichomonas spp. and Schistosoma spp. Other animal parasites include 
the various platyhelminth, nematode and annelid parasites. 

Suitable plant pests include insects, nematodes and molluscs such as slugs 
and snails. 

Suitable plants include monocotyledons and dicotyledons. 
25 Preferred organisms are those for which the entire genome is known and for 

which it may be possible to construct a high density oligonucleotide array covering 
the entire genome or all of the open reading frames. 

Essential and conditional essential genes, particularly essential genes are 
targets for drug discovery. That is, essential and conditional essential genes of 
3 0 bacteria and the polypeptides which they encode may represent targets for 
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antibacterial substances, for example. Similarly essential and conditional essential 
genes of fungi and eukaryotic parasites, pests and plants and the proteins which they 
encode may represent targets for fungicides, antiparasitics, pesticides and herbicides 
respectively. Fungicides may have both animal and plant applications. Additionally, 
5 conditional essential genes may represent targets for the generation of attenuated 
vaccines, particularly in the case of bacterial conditional essential genes. 

Furthermore, if a particular gene is essential or conditionally essential for a 
number of different bacteria, fungi, parasites, pests or plants, that gene and the 
polypeptide it encodes may represent a target for substances with a broad-spectrum 
10 of activity. 

An essential or conditional essential gene identified by one of the methods 
- described above using a transposon of the invention and the polypeptide which it 
encodes may be used in a method for identifying an inhibitor of transcription and/or 
translation of the gene and/or activity of the polypeptide encoded by the gene. Such 
15 a method may comprise identifying an essential or conditional essential gene using a 
method of the invention and then determining whether a test substance inhibits the 
transcription and/or translation of a gene thus identified or inhibits the activity of a 
polypeptide encoded by such a gene. 

Such a substance may be referred to as an inhibitor of an essential or 
20 conditional essential gene. Thus, an inhibitor of an essential or conditional essential 
■ gene is a substance which inhibits expression and/or translation of that essential gene 
and/or activity of the polypeptide encoded by that essential or conditional essential 
gene. 

Any suitable assay may be carried out to determine whether a test substance 
25 is an inhibitor of an essential or conditional essential gene. For example, the 
promoter of ah essential or conditional essential gene may be linked to a coding 
sequence for a reporter polypeptide. Such a construct may be contacted with a test 
substance under conditions in which, in the absence of the test substance expression 
of the reporter polypeptide would occur. This would allow the effect of the test 




substance on expression of the essential or conditional essential gene to be 
determined. 

Substances which inhibit translation of an essential or conditional essential 
gene may be isolated,, for example, by contacting the miRNA of the essential or 
5 conditional essential gene with a test" substance under conditions that would peiinit 
translation of the mRNA in the absence of the test substance. This would allow the 
effect of the test substance on translation of the essential or conditional essential gene 
to be determined. 

Substances which inhibit activity of a polypeptide encoded by the essential ' 
10 gene may be isolated, for example, by contacting the polypeptide with a substrate for 
the polypeptide and a test substance under conditions that would permit activity of 
the polypeptide in the absence of the test substance. This would allow the effect of 
the test substance on activity of the polypeptide encoded by the essential or 
conditional essential gene to be determined. 
15 Suitable control experiments can be carried out For example, a putative 

inhibitor should be tested for its activity against other promoters, mRNAs or 
polypeptides to discount the possibility that it is a general inhibitor of gene 
transcription, translation or polypeptide activity. 

Suitable test products which can be tested in the above assays include 
20 combinatorial libraries, defined chemical entities, peptide and peptide mimetics, 
oligonucleotides and natural product libraries, such as display (e.g. phage display 
libraries) and antibody products. Antibody products include monoclonal and 
polyclonal antibodies, single chain antibodies, chimaeric antibodies and CDR-grafted 
. antibodies. 

25 Typically, organic molecules will be screened, preferably small organic 

molecules which have a molecular weight of from 50 to 2500 daltons. Candidate 
products can be biomolecules including, saccharides, fatty acids, steroids, purines, 
pyrimi dines, derivatives, structural analogs or combinations thereof. Candidate 
agents are obtained from a wide variety of sources including libraries of synthetic or 

30 natural compounds. Known pharmacological agents may be subjected to directed or 
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random chemical modifications, such as acylation, alkylation, esterification, 
amidification, etc. to produce structural analogs. 

Test substances may be used in an initial screen of, for example, for example 
from 10 to 100 substances per reaction, and the substances of these batches which 

5 show inhibition or stimulation tested individually. Test substances may be used at a 
concentration of from InM to lOOOuM, preferably from luM to lOOuM, more 
preferably from luMto lOuM. Suitable test substances for inhibitors of essential or 
conditional essential genes include combinatorial libraries, defined chemical entities, 
peptides and peptide mimetics, oligonucleotides and natural product libraries. 

10 The test substances may be used in an initial screen of, for example, ten 

substances per reaction, and the substances of batches which show inhibition tested 
individually. 

An inhibitor of an essential or conditional essential gene is one which inhibits 
expression and/or translation of that essential gene and/or activity of Ihe polypeptide 

15 ' encoded by that essential or conditional gene. Preferred inhibitors of the invention 
are those which inhibit essential gene expression and/or translation and/or activity by 
at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at 
least 70%, at least 80%, at least 90%, at least 95% or at least 99% at a concentration 
of the inhibitor of 1 ngml- 1 , 10 ngn* 1 , 100 ngn* 1 , 500 ngml- 1 , 1 ugml- 1 , 10 ugml- 1 , 

20 100 ugml- 1 , 500 ugml-\ 1 mgml- 1 , 10 mgml- 1 , lOOmgrnl" 1 . The. percentage inhibition, 
represents the percentage decrease in expression and/or translation and/or activity in 
a comparison of assays in the presence and absence of the test substance. Any 
combination of the above mentioned degrees of percentage inhibition and 
concentration of inhibitor may be used to define an inhibitor of the invention, with 

25 greater inhibition at lower concentrations being preferred. 

Test substances which show activity in assays such as those described above 
. can be tested in in vivo systems, such as an animal model of infection for 
antibacterial activity or a plant model for herbicidal activity. Thus, candidate 
inhibitors could be tested for their ability to attenuate bacterial infections in mice in . 
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the case of an antibacterial or for their ability to inhibit growth of plants in the case of 
a herbicide. 

Inhibitors of bacterial, fungal or eukaryotic parasite essential or conditional 
essential genes may be used in a method of treatment of the human or animal body 
" 5 'by therapy. Tn pafti^lar sxfchTsu 

bacterial, fungal or eukaryotic parasite infection. Such substances may also be used 
for the manufacture of a medicament for use in the treatment of a bacterial, fungal or 
eukaryotic parasite infections The condition of a patient suffering from such an 
infection can be improved by administration of an inhibitor. A therapeutically 

1 0. effective amount of an inhibitor may be given to a human patient in need thereof. 
Inhibitors of bacterial, fungal or eukaryotic parasite essential or conditional essential 
genes may be administered in a variety of dosage forms. Thus, they can be 
administered orally, for example as tablets, troches, lozenges, aqueous or oily 
suspensions, dispersible powders or granules. The inhibitors may also be 

15 administered parenterally, either subcutaneously, intravenously, intramuscularly, 
intrasternally, trans dermally or by infusion techniques. The inhibitors may also be 
administered as suppositories. A physician will be able to determine the required 
• route of administration for each particular patient. 

The formulation of an inhibitor for use in preventing or treating a bacterial or 

20 fungal infection will depend upon factors such as the nature of the exact inhibitor, 
whether a pharmaceutical or veterinary use is intended, etc. An inhibitor may be 
formulated for simultaneous, separate or sequential use. 

An inhibitor is typically formulated for administration in the present 
invention with a pharmaceutically acceptable carrier or diluent. The pharmaceutical 

25 carrier or diluent may be, for example, an isotonic solution. For example, solid oral 
forms may contain, together with the active compound, diluents, e.g. lactose, 
dextrose, saccharose, cellulose, cornstarch or potato starch; lubricants, e.g. silica, 
talc, stearic acid, magnesium or calcium stearate, and/or polyethylene glycols; 
binding agents; e.g. starches, gum arabic, gelatin, methylcellulose, 

30 carboxymethylcellulose or polyvinyl pyrrolidone; disaggregating agents, e.g. starch, 
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alginic acid, alginates or sodium starch glycolate; effervescing mixtures; dyestuffs; 
sweeteners; wetting agents, such as lecithin, polysorbates, laurylsulphates; and, in 
general, non-toxic and pharmacologically inactive substances used in pharmaceutical 
formulations. Such pharmaceutical preparations may be manufactured in known 
5 manner, for example, by means of mixing, granulating, tabletting, sugar-coating, or 
film-coating processes. 

Liquid dispersions for oral administration may be syrups, emulsions or 
suspensions. The syrups may contain as carriers, for example, saccharose or 
saccharose with glycerine and/or mannitol and/or sorbitol. 

10 Suspensions and emulsions may contain as carrier, for example a natural 

gum, agar, sodium alginate, pectin, methylcellulose, carboxymethylcellulose, or 
polyvinyl alcohol. -The suspensions or solutions for intramuscular injections may 
contain, together with the active compound, a pharmaceutical^ acceptable carrier, 
e.g. sterile water, olive oil, ethyl oleate, glycols, e.g. propylene glycol, and if desired, 

15 a suitable amount of lidocaine hydrochloride. 

Solutions for intravenous administration or infusion may contain as carrier, 
for example, sterile water or preferably they may be in the form of sterile, aqueous, 
isotonic saline solutions. . 

A therapeutically effective amount of an inhibitor is administered to a patient. 

20 The dose of an inhibitor may be determined according to various parameters, 
especially according to the substance used; the age, weight and condition of the " 
patient to be treated; the route of administration; and the required regimen. Again, a 
physician will be able to determine the required route of administration and dosage 
for any particular patient. A typical daily dose is from about 0. 1 to 50 mg per kg of 

25 body weight, according to the activity of the specific inhibitor, the age, weight and 
conditions of the subject to be treated, the type and severity of the degeneration and 
the frequency and route of administration. Preferably, daily dosage levels are from 5 
mg to 2 g. 

Conditional essential genes are good candidates for. use in the preparation of 
30 live attenuated vaccines. The principle behind vaccination is to induce an immun e 
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response in the host thus providing protection against subsequent challenge with a 
pathogen. This may be achieved by inoculation with a live attenuated strain of the 
pathogen, i.e. a strain having reduced virulence such that it does not cause the disease 
caused by the virulent pathogen. Bacteria which carry mutations in conditional 
5 essential genes required for survival- (i.e. growu^rbUfefation) in a'host isolated 
according to the methods described above may be good candidates for use in live 

attenuated vaccines. 

Mutations introduced into a bacterium for use in a vaccine generally 

knock-out the function of the conditional essential gene, for example a gene required 
.10- : for growth/proliferation in a host, completely. This may be achieved either by 

abolishing synthesis of any polypeptide at all from the gene or by making a mutation 
. . that results in synthesis of non-functional polypeptide. In order to abolish synthesis 

of polypeptide, either the entire gene or its 5'-end may be deleted. A deletion or 

insertion within the coding sequence of a gene may be used to create a gene that 
15 synthases only non-functional polypeptide (e.g. polypeptide that contains only the 

N-terminal sequence of the wild-type protein). 

The bacterium may have mutations in one or more, for example two, three or • 

four conditional essential genes. The mutations are non-reverting mutations. These 

are mutations that show essentially no reversion back to the wild-type when the 
20 bacterium is used as a vaccine. Such mutations include insertions and deletions. 

Insertions and deletions are preferably large, typically at least 10 nucleotides in 

length, for example from 10 to 600 nucleotides. Preferably, the whole coding 

sequence is deleted. 

The bacterium used in the vaccine preferably contains only defined 
25' mutations, i.e. mutations which are characterised.. It is clearly undesirable to use a 
bacterium which has uncharacterised mutations in its genome as a vaccine because 
there would be a risk that the uncharacterised mutations may confer properties on the 
bacterium that cause undesirable side-effects. 

The attenuating mutations may be introduced by methods well known to 
30 those skilled in the art. Appropriate methods include cloning the DNA sequence of 
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the wild-type gene into a vector, e.g. a plasmid, and inserting a selectable marker into 
the cloned DNA sequence or deleting a part of the DNA sequence, resulting in its 
. inactivation. A deletion may be introduced by, for example, cutting the DNA 
sequence using restriction enzymes that cut at two points in or just outside the coding 
5 sequence and ligating together the two ends in the remaining sequence with an 

antibiotic resistance determinant. A plasmid. carrying the- inactivated DNA sequence 
can be transformed into the bacterium by known techniques such as electroporation 
or conjugation for example. It is then possible by suitable selection to identify a 
mutant wherein the inactivated DNA sequence has recombined into the chromosome 
10 of the bacterium and the wild-type DNA sequence has been rendered non-functional 
by homologous recombination. 

The attenuated bacterium of the invention may be genetically engineered to 
express an antigen that is not expressed by the native bacterium (a "heterologous 
antigen"), so that the attenuated bacterium acts as a carrier of the heterologous 
1 5 antigen. The antigen may be from another organism, so that the vaccine provides 
protection against the other organism. A multivalent vaccine may be produced which 
not only provides immunity against the virulent parent of the attenuated bacterium 
but also provides immunity against the other organism. Furthermore, the attenuated 
bacterium may be engineered to express more than one heterologous antigen, in 
20 which case the heterologous antigens may be from the same or different organisms. 
. The heterologous antigen may be a complete protein or a part of a protein containing 
an epitope. The antigen may be from a virus, prokaryote or a eukaryote, for example 
another bacterium, a yeast, a fungus or a eukaryotic parasite. The antigen may be 
from an extracellular or intracellular protein. More especially, the antigenic 
25 sequence may be from Rcoli, tetanus, hepatitis A, B or C virus, human rhinovirus 
such as type 2 or type 14, herpes simplex virus, poliovirus type 2 or 3, 
foot-and-mouth disease virus, influenza virus, coxsackie virus or Chlamydia 
. trachomatis. Useful antigens include non-toxic components of E.coli heat labile 
toxin, Kcoli K88 antigens, ETEC colonization factor antigens; P.69 protein from 
3 0 B. pertussis and tetanus toxin fragment C. 
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The DNA encoding the heterologous antigen is expressed from a promoter 
that is active in vivo. Two promoters that have been shown to work well in 
Salmonella are the nirB promoter and the htrA promoter. For expression of the 
ETEC colonization factor antigens, the wild-type promoters could be used. 
5 A- DNA construct comprising- the-promoter operably linked to DNA encoding the- 
heterologous antigen may be made and transformed into the attenuated bacterium 
using conventional techniques. Transformants containing the DNA construct may be 
selected, for example by screening for a selectable marker on the construct. Bacteria 
containing the construct may be grown in vitro before being formulated for 

1 0 administration to the host for vaccination purposes. 

The vaccine may be formulated using known techniques for formulating 
attenuated bacterial vaccines. The vaccine is advantageously presented for oral 
administration, for example in a lyophilised encapsulated form. Such capsules may 
be provided with an enteric coating comprising, for example, Eudragate "S H (Trade 

15 Mark), Eudragate "L" (Trade Mark), cellulose acetate, cellulose phthalate or 

hydroxypropylmethyl cellulose. These capsules may be used as such, or alternatively, 
the lyophilised material may be reconstituted prior to administration, e.g. as a 
suspension. Reconstitution is advantageously effected in a buffer at a suitable pH to 
• ensure the viability of the bacteria. In order to protect the attenuated bacteria and the 

20 vaccine from gastric acidity, a sodium bicarbonate preparation is advantageously 
administered before each administration of the vaccine. Alternatively, the vaccine 
may be prepared for parenteral administration, intranasal administration or 
intramuscular administration. 

The vaccine may be used in the vaccination of a mammalian host, particularly 

25 a human host but also an -animal host. An infection caused by a microorganism, 

especially a pathogen, may therefore be prevented by administering an effective dose 
of a vaccine prepared according to the invention. The dosage employed will 
ultimately be at the discretion of the physician, but will be dependent on various 
factors including the size and weight of the host and the type of vaccine formulated. 




However, a dosage comprising the oral administration of from 10 7 to 10 u bacteria 
per dose may be convenient for a 70 kg adult human host 

Inhibitor of bacterial, fungal and pest essential or conditional essential genes 
may be administered to plants in order to prevent or treat bacterial, fungal or pest 
5 infections; the term pest includes any animal which attacks a plant. Thus inhibitors 
of the invention may be useful as pesticides. Inhibitors of plant essential or 
conditional essential genes may be administered to plants in order to reduce or stop 
plant growth, that is to act as a herbicide. 

The inhibitors of the present invention are normally applied in the form of 
10 • compositions together with one or more "agriculturally acceptable carriers or diluents 
and can be applied to the crop area or plant to be treated, simultaneously or in 
succession with further compounds. 

The inhi bitors of the invention can be selective herbicides, bactericides, 
fungicides or pesticides or mixtures of several of these preparations, if desired 
15 together with further carriers, surfactants or application-promoting adjuvants 
customarily employed in the art of formulation. Suitable carriers and diluents 
correspond to substances ordinarily employed in formulation technology, e.g. natural 
or regenerated mineral substances, solvents, dispersants, wetting agents, tackifiers, 
binders or fertilizers. 

20 . A preferred method of applying active ingredients of the present invention or 

an agrochemical composition which contains at least one of the active ingredients is 
leaf application. The number of applications and the rate of application depend on 
the intensity of infestation by the pathogen. However, the active ingredients can also 
penetrate the plant through the roots via the soil (systemic action) by impregnating 

25 the locus of the plant with a liquid composition, or by applying the compounds in 
solid form to the soil, e.g. in granular form (soil application). The active ingredients 
may also be applied to seeds (coating) by impregnating the seeds either with a liquid 
formulation containing active ingredients, or coating them with a solid formulation. 
In special cases, further types of application are also possible, for example, selective 

3 0 . treatment of the plant stems or buds. 
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The active ingredients are used in unmodified form or, preferably, together 
with the adjuvants conventionally employed in the art of formulation, and are 
therefore fonnulated in known manner to emulsifiable concentrates, coatable pastes, . 
directly sprayabte or dilutable solutions, dilute emulsions, wettable powders, soluble 
5 powders, dusts~granulates, and also encapsulate^ 

substances. Like the nature of the compositions, the methods of application, such as 
spraying, atomizing, dusting, scattering or pouring, are chosen in accordance with the 
intended objectives and the prevailing circumstances. Advantageous rates of 
application are normally from 50g to 5kg of active ingredient (ai.) per hectare ("ha", 
_1Q, approximately 2.471 acres), preferably from lOOg to 2kg ai./ha, most preferably 
from 200g to 500g a.i./ha. 

The formulations, compositions or preparations containing the active 
ingredients and, where appropriate, a solid or liquid adjuvant, are prepared in known 
manner, for example by homogeneously mixing and/or grinding active ingredients 
15 with extenders, for example solvents, solid carriers and, where appropriate, 
surface-active compounds (surfactants). 

Suitable solvents include aromatic hydrocarbons, preferably the fractions 
having 8 to 12 carbon atoms, for example, xylene mixtures or substituted 
naphthalene, phthalate such as dibutyl phthalate or dioctyl phthalate, aliphatic 
20 hydrocarbons such. as cyclohexane or paraffins, alcohols and glycols and their ethers 
and esters, such as ethanol, ethylene glycol, monomethyl or monoethyl ether, ketones 
such as cyclohexanone, strongly polar solvents such as N-methyl-2-pyrrolidone, 
dimethyl sulfoxide or dimethyl fonnamide, as well as epoxidized vegetable oils such 
as epoxidized coconut oil or soybean oil; or water. 
25 The solid carriers used e.g. for dusts and dispersible powders, are normally 

• natural mineral fillers such as calcite, talcum, kaolin, montmorillonite or attapulgite. 
In order to improve the physical properties it is also possible to add highly dispersed 
silicic acid or highly dispersed absorbent polymers. Suitable granulated adsoxptive . 
carriers are porous types, for example pumice, broken brick, sepiolite or bentonite; 
30 and suitable nonsorbent carriers are materials such as calcite or sand. In addition, a 
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great number of pfegranulated materials of inorganic or organic nature can be used, 
e.g. especially dolomite or pulverized plant residues. 

Depending on the nature of the active ingredient to be used in the 
formulation, suitable surface-active compounds arenonionic, cationic and/or anionic 
5 surfactants having good emulsifying, dispersing, and wetting properties. The term 
"surfactants" will also be understood as comprising mixtures of surfactants. 
Suitable anionic surfactants can be both water-soluble soaps and water-soluble 
synthetic surface-active compounds. 

Suitable soaps are the alkali metal salts, alkaline earth metal salts or 
1 0 unsubstituted or substituted ammonium salts of higher fatty acids (chains of 1 0 to 22 
carbon atoms), for example the sodium or potassium salts of oleic or stearic acid, or 
of natural fatty acid mixtures which can be obtained for example from coconut oil or 
tallow oil. The fatty acid methyltaurin salts may also be used. 

More frequently, however, so-called synthetic surfactants are used, especially 
1 5 fatty sulfonates, fatty sulfates, sulfonated benzimidazole derivatives or 
. alkylarylsulfonates. 

The fatty sulfonates or sulfates are usually in the form of alkali metal salts, 
alkaline earth metal salts or unsubstituted or substituted ammoniums salts and have a 
8 to 22 carbon alkyl radical which also includes the alkyl moiety of alkyl radicals, for 
20 example, the sodium or calcium salt of lignonsulfonic acid, of dodecylsulf ate or of a 
mixture of fatty alcohol sulfates obtained from natural fatty acids. These compounds 
also comprise the salts of sulfuric acid esters and sulfonic acids of fatty 
alcohol/ethylene oxide adducts. The sulfonated benzimidazole derivatives preferably 
contain 2 sulfonic acid groups and one fatty acid radical containing 8 to 22 carbon 
25 atoms. Examples of alkylarylsulfonates are the sodium, calcium or triethanolamine 
salts of dodecylbenzenesulfonic acid, dibutylnaphthalenesulfonic acid, or of a . 
naphthalenesulfonic acid/formaldehyde condensation product. Also suitable are 
corresponding phosphates, e. g. salts of the phosphoric acid ester of an 
adduct of p-nonylphenol with 4 to 14 moles of ethylene oxide. 
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Non-ionic surfactants are preferably polyglycol ether derivatives of aliphatic 
or cycloaliphatic alcohols, or saturated or unsaturated fatty acids and alkylphenols, 
said derivatives containing 3 to 30 glycol ether groups and 8 to 20 carbon atoms in 
the (aliphatic) hydrocarbon moiety and 6 to 18 carbon atoms in the aikyi moiety of 
5 the -alkylphenols: 

Further suitable non-ionic surfactants are the water-soluble adducts of 
polyethylene oxide with polypropylene glycol, ethylenediamine propylene glycol 
. and alkylpolypropylene glycol containing 1 to 10 carbon atoms in the alkyl chain; 
which adducts contain 20 to 250 ethylene glycol ether groups and 10 to 100 
.10 propylene glycol ether groups. These compounds usually contain 1 to 5 ethylene 
glycol units per propylene glycol unit 

Representative examples of non-ionic surfactants are 
nonylphenolpolyethoxyethanols, castor oil polyglycol ethers, 
polypropylene/polyethylene oxide adducts, tributylphenoxypolyethoxyethanol, 
15 polyethylene glycol and octylphenoxyethoxyethanol. Fatty acid esters of 

polyoxyethylene sorbitan and polyoxyethylene sorbitan trioleate are also suitable 
non-ionic surfactants. 

Cationic surfactants are preferably quaternary ammonium salts which have, 
as N-substituent, at least one Cg-C^ alkyl radical and, as further substituents, lower 
20 unsubstituted or halogenated alkyl, benzyl or lower hydroxyalkyl radicals. The salts 
• are preferably in the form of halides, methylsulfates or ethylsulfates, e.g. 

steary Itrimetfaylarn m onium chloride or benzyldi(2-chloroethyl)ethylammonium 
bromide. • 

The surfactants customarily employed in the art of formulation are described, 
25 for example, in "McCutcheon's Detergents and Emulsifiers Annual", MC Publishing 
Corp. Ringwood, New Jersey, 1979, and Sisely and Wood, "Encyclopaedia of 
Surface Active Agents," Chemical Publishing Co., Inc. New York, 1980. 
The agrochemical compositions usually contain from about 0. 1 to about 99% 
preferably about 0.1 to about 95%, and most preferably from about 3 to about 90% of 
30 the active ingredient, from about 1 to about 99.9%, preferably from about 1 to 99%, 
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and most preferably from about 5 to about 95% of a solid or liquid adjuvant, and 
from about 0 to about 25%, preferably about 0. 1 to about 25%, and most preferably 
from about 0. 1 to about 20% of a surfactant 

Whereas commercial products' are preferably formulated as concentrates, the 
5 end user will normally' employ dilute formulations. 

The following Examples illustrate the invention: 

». 

Examples 



Materials and Methods 

Unless indicated otherwise, the methods used are standard biochemical 
techniques. Examples of suitable general methodology textbooks include Sambrook 
et al., Molecular Cloning, a Laboratory Manual (1989) and Ausubel et at, Current 
1 5 Protocols in Molecular Biology (1 995), John Wiley & Sons, Inc. 

E. coli strains were grown on L-Agar (Sigma) and in L-Broth (Sigma). 
Kanamycin was purchased from Sigma. £ typhimurium SL1344 was grown in 
Tryptic Soy Broth (TSB, Oxoid). 

20 Example 1. Construction of a G ene-Kellv transposon 

The EZ:Tn R6k ori Kan transposon (Epicentre) was used as a PCR template 
with oligonucleotides: 
' 97(SEQIDNO:2) 

25 S'-CAC&TUJXJTCTTATACACATCTCCCTATAGTGAGTCGTAT TACCCATAA 
T ACCCATA ATAGCTGTTTGCCAg tcgactctagagg-3 ' ; and 
98 (SEQ JD NO: 3) 

' 5 , -CAGCTGTCTCTTATACACATCTCTTCTATAG'TGTCACCTAA ATAGGGAT 
A ACAGGGTAAT Gaattcgttaatacagatgt-3 ' . 
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ME are italicised, the RNA polymerase binding sites are in bold, and the 
•* homing endonuclease sites are underlined. These oligonucleotides incorporate the 
T7 RNA polymerase site with the homing endonuclease site Yl-Psp\ and the SP6 
RNA polymerase site with the homing endonuclease site for I-Scel, respectively. 
• 5 ECR was carried out accxTf dm 

A first round of PCR, to introduce the homing endonuclease and RNA 
polymerase sites, was carried out with an initial denaturation step at 96°C for 3 min, 
then 5 cycles of 96°C 30s, 25°C 90s, 72°C 2min 30s, then a further 25 cycles at 96°C 
30s, 50°C 90s, 72°C 2min 30s, final extension of 4 minutes and then cooled to 4°C. 
1 0 The PCR product was separated by gel electrophoresis, and the band (corresponding 
to the expected product at 2044bp) cut out and Gel extracted according to the Qiagen 
gel extraction protocol. The PCR products were re-ligated (according to Gibco 
protocol for T4 DNA ligase) overnight at 16°C. The ligation mixture was then 
purified using the Qiagen Gel Extraction protocol and the -circularised transposon 

15 eluted in 50 ^1 water. 1 p.1 of the Transposon DNA was then electroporated into 40 \il 
TransforMax™ EC100D™ /?/> + Electrocompetent£l coli (Epicentre; using 0.1 cm 
cuvettes at 200£2, 25 jiF and 20KV/cm), outgrowth was in 1 ml SOC medium (Gibco 
life technologies) for 1 h at 37°C. The transformation was then plated out on L-Agar 
(Sigma) plates containing Kanamycin at 30 jig/ml and incubated overnight at 37°C. 

20 12 colonies were then picked and grown up overnight in LB -Kan 30 ^ig/ml and 1.5 
ml of culture was used for DNA-mini-preps according to the Qiagen protocol. The 
DNA sequence of the resulting constructs was then determined on a Beckman CEQ 
DNA sequencer, (using manufacturers recommended conditions) with kan-2-FP-l 
forward primer (Epicentre) as a DNA sequencing primer. The sequence of clone 

25 1 1 (GK1 1) was found to contain a single base pair change in one of the ME 

sequences. To correct this sequence change, an experiment using a second round of 
• PGR was designed with an oligonucleotide that corrected the single base pair change. 
The clone GK1 1 (harbouring a single-point mutation in the ME) was re-PCR'd using 
un-phosphorylated ME primers (96 6 C for 3min, followed by 30 cycles of 96°C for . 

3 0 30s, 45°C for 90s, 72°C for 1 50s). the PCR products were then cloned into 
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pBAD-TOPO (Invitrogen) and transformed into E. coli TOP 10 chemically competent 
cells according to the Invitrogen protocol. The transformation mix was plated out on 
L-Agar containing kanamycin at 30 fig/ml and incubated overnight at 37°C. 6 
colonies were then picked and grown in L-Broth containing kanamycin at 30 jig/ml 
5 overnight at 37°C. DNA from 1 .5 mL samples of these cultures was then prepared 
using the miniprep method (Qiagen QIAprep spin miniprep kit) according to the 
Qiagen protocol. The DNA sequences of the inserts from each of these 6 minipreps 
was then determined with oligonucleotides kan-2-FP-l and R6kan-2-RP-l 
(Epicentre) as sequencing primers according to Beckman CEQ protocol. One of the 
10 isolates, clone 6 A was correct when sequenced, named pBAD-GK6A, and was 
s elected for further study. 

Example 2. Evaluation of the Gene-Kelly transposon 

(i) 17 and SP6 RNA polymerase evaluation 

15 In order to evaluate the novel transposon, 0.2 ^ig of pBAD-GK6A was used 

as template in an in vitro transcription (TVT) reactions using both SP6 (SP6 
Megascript kit; Ambion) and T7 (T7 Megashortscipt kit, Ambion) RNA 
. polymerases, according to Ambion protocols. RNA produced was purified using the 
Qiagen RNeasy mini Kit, and the amount of RNA produced measured at A 26 o- 

20 Transcription was observed both with the SP6 and T7 IVT reactions. 

(ii) Restriction digest of transposon DNA with homing endonucleases 

In order to determine whether or not the homing endonuclease sites were 
functional in the GK transposon, plasmid DNA (0.5 |ng) was digested with I-Scel and 
25 PI-PipL Separation of the resulting DNA products by agarose gel electrophoresis 
showed a single linearised band of the correct size. 

(iii) Transposition of Salmonella tvphimurium 

pBAD-GK6A (0.1 |ig) was electroporated into 40 \il & typhimurium SL1344 
30 electrocompetent cells (SL1344 was grown to an OD 0.5 in 100 ml tryptic-soy broth 
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(TSB from Oxoid) at 37°C. Cells were centrifuged at 5000 x g for 10 min at 4°C, 
washed three times in 50 ml 10% glycerol before a final re-suspension in 1 ml of 
10% glycerol) using a 0.2 cm cuvette, 200Q, 25 jiF and 12kV/cm, outgrowth was in 
1 ml SOC (Gibco) at 37°C for 1 h, the transformation was then plated onto L-Agar 
5 plates-contaiiiingrKanamycin at 50 jlg/nd" plates and grown ovemigjit at 37°C. A 
colony was then picked and grown in 2.5 litres L-Broth containing Kanamycin at 50 
jig/ml overnight at 37°C. A Qiagen Qiafilter Mega plasmid kit was used to purify 
pBAD-GK6A. 

10 Example 3. Generation of S. typhimurium SL1344 mutants with Transposon 
GK6a 

The transposome complex was generated from pBAD-GK6A as follows. 
pBAD-GK6a (100 jig) of was. digested with Xmnl and Ncol (NEB) in NEB buffer 2 
(with 1 jig BSA/ ml), overnight at 37°C. The entire digest was then run out on a 

15 0. 8% agarose gel, the ~2kb band, the GK transposon, was gel extracted using the 

Qiagen gel extraction kit, and eluted from 1 spin column in 50 \A TE pH8.5. The GK 
transposome complex was generated according to Epicentre protocols and 
electroporated into electrocompetent £ typhimurium SL1344 cells as described 
above. Following outgrowth the cells were subsequently plated out on Tryptic Soy 

20 Agar plates (TSA, Oxoid) containing Kanamycin at 50 |ig/ml and incubated 

overnight at 37°C. A total of 480 mutants were picked and grown overnight in 2 ml 
TSB (Oxoid) containing Kanamycin at 50 |ig/ml and glycerol stocks made (20% 
glycerol TSB). 

25 Example 4. Recovery of transposon for sequencing fuse of R6k origin of 
replication) 

Transposons and adjacent flanking DNA, corresponding to genes that have 
been disrupted by transposon insertion can be recovered from mutant chromosomal 
DNA samples using the use of R6k origin of replication. Digestion of chromosomal 
30 DNA purified from S. typhimurium SL1344 GK mutants with a restriction enzyme 
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that does not cut in the transposon, foUowed by circularising the fragments, and 
transformation into a^r + strain of K coli, results in the "rescuing" of this DNA. 

Mutants 1-50 were grown up individuaUy in 2 ml TSB containing Kanamycin 
at 50 ug/ml overnight at 37°C. Samples (1.5ml) was used to prepare chromosomal 

5 DNA using the Qiagen DNeasy tissue kit A total of 5 pi (0.5 pg) of each of me fifty 
chromosomal preps was digested with EcoRV (NEB) in a final volume of 20 pi, 
overnight at 37»C.' The EcoRV was heat inactivated at 80°C for 20 minutes. The 20 
pi digest was then religated in 100 pi final volume using Gibco T4 DNA ligase, 48 
hours at 4°C. Each religation was then individuaUy cleaned up using a Qiagen gel 

10 extraction spin column and eluted in 50 pi water. Ligations (4 pi, 0.04 pg) were 
electroporated into electrocompetent^ K coli (EC100D, Epicentre) according to 
Epicentre protocols and plated on L-Agar containing Kanamycin at 30 pg/ml and 
' incubated overnight at 37°C. Colonies were obtained from 46 of the electroporations, 
and were subsequently grown up in 5 ml L-Broth containing Kanamycin at 30 pg/ml 

15 37°C overnight. Plasmid DNA (2 pg) from these clones was sequenced according to 
the Beckman CEQ protocol using ohgonucleotide 108 as a sequencing primer (T7 
end of the transposon). 

FTnr . r ,. g ^»^Hnn nf RNA n.n - offs ..sin g iPCP ,nd TVT for target 

20 h yhridisati ™ mirroarravs 

The generation of labelled target from GK mutants can be achieved by 

inverse PCR (iPCR) amplification of each end of the transposon followed by WT 
reactions using either SP6 or T7 RNA polymerises. A pool of 96 £ typhimurium 
SL1344 mutants was inoculated into L-Broth (10 ml) containing Kanamycin at 50 

25 - pg/ml and grown overnight at 37°C. statically. Chromosomal DNA (20 pg) was 
prepared from 1 .5ml of culture using the Qiagen DNeasy Kit, and 5 pi .(0.5 pg) 
digested individuaUy with the restriction enzymes HaeJH, Hhal, Hpych4TV and Rsal 
(NEB) in their respective NEB buffers in a final volume of 20 pL overnight at 37°C. 
The enzymes were then heat denatured at 65»C for JM, Hpych4 IV and Rsal and 

30 80°Cfor#a e mfor20min. Each 20 pi digestwas then self-Ugated with T4 DNA 
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ligase (Gibco) in a 100 \il reaction at 4°C for 48 h. Amplification of the DNA 
flanking each end of the transposon was achieved by iPCR. iPCR reactions to 
amplify the SP6 end of the transposon are performed with: 
oligonucleotide 107 (SEQ ID NO: 4) 
5 S^eTAeeCTGTGGAACACCTACATCT-r; 
and one of either 

oligonucleotide 115 (SEQ ID NO: 5) 

5 , -ATTACCTCTTTCTCCGCACCCGAC-3 , ; Rsal or Hpych4SV or 

oligonucleotide 1 16 (SEQ ID NO: 6) 
10 . . 5 ' -CGACATAGATCCGGAAC ATAATGG-3 ' ; HaeUI oxHhal, 

depending on the restriction enzyme used (in brackets) to cut the chromosomal DNA. 

iPCR reactions to amplify the T7 end of the transposon were performed with 

oligonucleotide 108 (SEQ ID NO: 7) 

5 ' - ACCTACAACAAAGCTCTCATCAACC -3 ' 
15 and one of either 

* oligonucleotide 1 17 (SEQ ID NO: 8) 

5'- ACAACCTATTAATTTCCCCTCGTC -3'; JRsal, HaeSl ovHhal or 

oligonucleotidel 1 8 (SEQ ID NO: 9) 

5 ' - ATGTTGGAATTTAATCGCGGCCTC -3 5 ; Hpych4JV, 

20 depending on the restriction enzyme used (in brackets) to cut the chromosomal DNA. 
iPCR reactions were then set upjusing Qiagen Taq polymerase according to the 
Qiagen protocol. 4 ^1 (0.02 ^ig) of ligation was used as template for each iPCR 
reaction. The reactions were initially denatured at 94°C 3 * followed by 30 cycles of . 
94°C 30s, 65°C 90s, 72°C 90s followed by 7min extension at 72°C and then cooling 

25 to 4°C. Each of the 8 iPCR's were purified using a Qiagen Gel extraction kit and the 
DNA eluted in 5 p.1 water. Each iPCR product was then re-digested with its 
respective restriction enzyme in a final volume of 50 jil (NEB) overnight 37°C. The 
digests were then cleaned using a Qiagen Gel extraction kit (following the 
manufacturers recommended procedure) and the DNA eluted in 50 fil EB pH8.5. 
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Each digested iPCR product (2 jil) was used as a template for both T7 and 
SP6 in vitro transcription reactions according to the Ambion protocol. The RNA was 
' cleaned using the Qiagen Rneasy kit and eluted in 50 \il of RNase free water that was 
then placed in a UV transparent 96 well piate and the absorbance at 260 nm 
5 measured. 

• Example 6. Ligation capture recovery of Gene Kelly transposon ends 

A significant advantage of the Gene Kelly (GK) transposon is that it permits 
the recovery of DNA fragments adjacent to the site of transposon insertion by a 

10 method that does not employ PGR, which is ligation capture. . Essentially, because of 
the rarity of the L-Scel and T?L-PspL homing endonuclease sites the T7 and SP6 
promoter sites that are linked to these sites can be enriched from a pool of DNA by 
ligation of a biotinylated link er to the cut site followed by purification using 
Streptavidin linked magnetic beads. A ligation capture experiment was performed on 

1 5 pB AD-GK6A. Plasmid DNA (1 jig) was digested with PI-P^spI overnight and the 
resulting linearised DNA purified using a Qiagen Gel extraction kit This DNA (400 
ng) was then digested overnight with HaeUL and subsequently dephosphorylated 
using Calf Intestinal Alkaline Phosphatase (Roche). Dephosphorylated DNA was 
then ligated overnight onto a biotinylated linker, generated by annealing 

20 oligonucleotide 113 (SEQ ID NO: 10) 

5 ' -biotin-GACGACCTCAGTTACGGTACGATCGGCCACGTAGCTTAT-3 ' 

. and oligonucleotide 1 14 (SEQ ID NO: 1 1) 
5 '-phosphate-GCTACGTGGCCGATCGTACCGTAACTGAGGTCGTC-3 ' . 

The ligation was purified using a Qiagen Gel extraction. Biotinylated DNA 
25 was then extracted from the ligation using Streptavidin-linked magnetic particles 
(150 fig; Promega) according to the manufacturers protocol, and the beads finally 
resuspended in 8 (il of Ik HaeUL restriction buffer containing 10 units of HaeUL 
(NEB). The digestion was then incubated at 37°C for 2 hours to remove the T7 RNA 
promoter from the linker. The beads were removed and an IVT reaction performed 
30 on the supernatant using an Ambion 17 Megashortscript kit, according to the 
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manufacturers instructions. IVT products were purified using a Qiagen RNeasy kit, 
and the products eluted in 50 ul water and read at Amo- 

From the sequence data obtained we were able to identify the transposon 
insertion point in the published Salmonella genome LT2- Figure 4 shows a graph 
5 showing me random destitution of the sites orGKtransposon. distribution in the - 
LT2 genome for the 46 sequenced mutants . 

Tem ple. 7. Identification of g e ™« in Salmonella typhimurium important for 
virulence in the mou se model of infection 

10 . • ' 480 1 random Gene Kelly transposon mutants were generated in Salmonella 
typhimurium SL1344. The mutants were grown individually in 1ml of L-broth in 96- 
wellHTS plates, overnight and subsequently pooled. Chromosomal DNA (78.6 ug) . 
was purified and subsequently restriction digested separately with HaeUl audita* 
and the products purified with Qiagen gel extraction columns. The DNA was 
15 divided in half and separate IVT reactions (19.6ug DNA template/TVT) were 
performed with the respective RNA polymerase (SP6 and TV) resulting in the 
generation of labelled target (SP6 and T7 target was labelled with Cy5 and Cy3, 
respectively) corresponding to the DNA flanking both ends of transposon insertions. 
DNA microarrays were designed based on the entire Salmonella typhimurium 
20 LT2 genome sequence, with probes synthesised in both the sense and anti-sense 
directions. Following hybridisation, the data were extracted from the arrays and 
analysed to identify those genes disrupted by transposon insertion within the pool of 
' 480 mutants. New microarrays were designed incorporating probes corresponding to 
each site of transposon insertion. 
25 The mutants were grown individually in lml of L-broth in 96-wellHTS 

plates, overnight. Cultures were pooled ('Input pool') and chromosomal DNA 
purifiedfromSOmlofmeresmtant^^ An inoculum of 10 5 ^ 

cfu / ml PBS was generated from the 'input pool'. Mice were inoculated i.v. with 10 s 
cfu of the 'input pool' and the infection allowed to proceed for 2.5 days, whereupon 
30 the mice were sacrificed and liver and spleen removed. Organs were pulverised in 
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• 10ml water and spread onto 4 x 120 mm diameter L-Agar plates and incubated 
overnight at 37°C. Bacteria ('output pool') were harvested from these plates by re- 
suspending the bacterial lawns in 10 ml L-broth / plate. Chromosomal DNA ('output 
pool DNA 5 ) was purified from 3 ml of this suspension 
5 'Input' and 'output pool' DNA (5ug) was digested overnight with the 

restriction endonucleaseifral, and subsequently cleaned on a Qiagen Gel extraction 
column. In vitro transcription reactions were set up using the Ambion 17 and SP6 
Megascriptkits, using 2ug of digested DNA as template, with Cy3-CTP and Cy5- 
CTP, respectively, and incubated at 37°C for 24hrs. Following the DNasel step RNA 

1 0 was purified using a Qiagen nucleotide removal kit. The KNA from both the SP6 
and T7 IVT reactions were then hybridised overnight to a DNA microarray 
containing probes corresponding to each transposon insertion site. Arrays were 
washed in 0.6xSSPE X followed by 0.06XSSPE 0.18%PEG200, each wash lasting 
5min. Slides were then dried and analysed using an Agilent microarray scanner . 

1 5 Several mutants in the pool of 96 were analysed using DNA sequencing to ascertain 
the precise point of transposition within the SL1344 chromosome. One mutant was 
characterised as containing a transposon within the aroK gene that would lead to the 
loss of its function. AroA. mutants are one of the best genetically defined 
• S. typhimurium vaccine strains (Chatfield etal, 1992; Micrbb. Pathog. 12: 145-151). 

20 These mutants are reduced in their ability to survive within susceptible mice 
compared to its wild type parent strain, such that 3 days post infection bacterial 
levels are reduced 1000-fold. 

Data from the microarray images were extracted using Agilent's image 
analysis software and the data fed into a data viewing package. Array data from the 

25 input and output hybridisations were compared (see Figure 7). 

Analysis of both the array image and the extracted data from the input pool 
reveal that target was generated to the aroA gene and that this target hybridised to the 
expected probes surrounding the site of transposon insertion. Analysis of the 
corresponding data- from all three output pools revealed that significantly less target 

3 0 was hybridised to the probes corresponding to the ctroA gene compared to two 
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control mutants with transposons disrupting other loci (gene X and an intergenic 
region) in the SL1344. genome. These data indicate that the aroA mutant is 
attenuated in its ability to survive within the mice. Therefore this technique allows 
the identification of genes that are important for the virulence of S. typhimurium in 
- -5- the mouse-model: of infection. 
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SEQUENCE LISTING 

<110> ARROW THERAPEUTICS LIMITED 
<120> TRANSPOSON 
<130> P.84806A GCW 
<160> 11 
<170> Patentln version 3.0 

<210> 1 

* <211> 2044 
15 <212> DNA 

<213> Artificial transposon 

<400> 1 

ctgtctctta tacacatctc ttctatagtg tcacctaaat agggataaca gggtaatgaa 60 

20 ttcgttaata cagatgtagg tgttccacag ggtagccagc agcatcctgc gatgcagatc 120 

cggatgccat ttcattacct ctttctccgc acccgacata gatccgaaga tcagcagttc 180 

aacctgttga tagtacgtac taagctctca tgtttca'cgt actaagctct catgtttaac 240 

gtactaagct ctcatgttta acgaactaaa ccctcatggc taacgtacta agctctcatg 300 

gctaacgtac taagctctca tgtttcacgt actaagctct catgtttgaa caataaaatt 360 

25 • aatataaatc agcaacttaa atagcctcta aggttttaag ttttataaga aaaaaaagaa 420 
tatataaggc ttttaaagct tttaaggttt aacggttgtg gacaacaagc cagggatctg . 480 

ccatttcatt acctctttct ccgcacccga catagatccg gaacataatg gtgcagggcg 540 

ctgacttccg cgtttccaga ctttacgaaa cacggaaacc gaagaccatt catgttgttg 600 

ctcaggtcgc agacgttttg cagcagcagt cgcttcacgt tcgctcgcgt atcggtgatt 660 

30 cattctgcta accagtaagg caaccccgcc agcctagccg ggtcctcaac gacaggagca 720 

cgatcatgcg cacccgtggc caggacccaa cgctgcccga gatgcgccgc gtgcggctgc 780 

tggagatggc ggacgcgatg gatatgttct gccaagggtt ggtttgcgca ttcacagggt 840 

gtctcaaaat ctctgatgtt acattgcaca agataaaaat atatcatcat gaacaataaa 900 

actgtctgct tacataaaca gtaatacaag gggtgttatg- agccatattc aacgggaaac 960 

35 gtcttgctcg aggccgcgat taaattccaa catggatgct gatttatatg ggtataaatg 1020 

* ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgattgtatg ggaagcccga 1080 
tgcgccagag ttgtttctga aacatggcaa-aggtagcgtt gccaatgatg ttacagatga 1140* 
gatggtcaga ctaaactggc tgacggaatt tatgcctctt ccgaccatca agcattttat 1200 
ccgtactcct gatgatgcat ggttactcac cactgcgatc cccggaaaaa cagcattcca 1260 

40 ggtattagaa gaatatcctg attcaggtga aaatattgtt gatgcgctgg cagtgttcct 1320 

gcgccggttg cattcgattc ctgtttgtaa ttgtcctttt aacagcgatc gcgtatttcg 1380 

tctcgctcag gcgcaatcac gaatgaataa cggtttggtt gatgcgagtg attttgatga 1440 

cgagcgtaat. ggctggcctg ttgaacaagt ctggaaagaa atgcataaac ttttgccatt 1500 

ctcaccggat tcagtcgtca ctcatggtga tttctcactt gataacctta tttttgacga 1560 

45 ggggaaatta ataggttgta ttgatgttgg acgagtcgga atcgcagacc gataccagga 1620 

tcttgccatc ctatggaact gcctcggtga gttttctcct tcattacaga aacggctttt 1680 

tcaaaaatat ggtattgata atcctgatat gaataaattg cagtttcatt tgatgctcga 1740 

tgagtttttc taatcagaat tggttaattg gttgtaacac tggcagagca ttacgctgac 1800 

ttgacgggac ggcggctttg ttgaataaat cgaacttttg ctgagttgaa ggatcagatc 1860 

50 acgcatcttc ccgacaacgc agaccgttcc gtggcaaagc aaaagttcaa aatcaccaac 1920 

tggtccacct acaacaaagc tctcatcaac cgtggcgggg atcctctaga gtcgactggc 1980 

aaacagctat tatgggtatt atgggtaata cgactcacta tagggagatg tgtataagag 2040 

acag ^ 2044 
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<210> 2 
<211> 85 
<212> DNA 

<213> Artificial oligonucleotide 

5 

<400> 2 

cagctgtctc ttatacacat ctccctatag tgagtcgtat tacccataat acccataata 60 
-gctgtttgcc agtcgactct - agagg- - -- 85- 

10 <210> 3 
<211> 80 
<212> DNA 

<213> Artificial oligonucleotide 
15 <400> 3 

cagctgtctc ttatacacat ctcttctata gtgtcaccta aatagggata acagggtaat 60 
gaattcgtta atacagatgt 80 

<210> 4 

20' <211> 24 

• * <212> DNA 

<213> Artificial oligonucleotide 

<400> 4 

25 ctaccctgtg gaacacctac atct ' 24 

<210> 5 

<211> 24 

<212> DNA 

30 <213> Artificial oligonucleotide 

<400> 5 

attacctctt tctccgcacc cgac 24 

35 <210> 6 
<211> 24 
<212> DNA 

<213> Artificial oligonucleotide 
40 <400> 6 

cgacatagat ccggaacata atgg 24 

<210> 7 

<211> 25 

45 <212> DNA 

<213> Artificial oligonucleotide 

<400> 7 

acctacaaca aagctctcat caacc ca 
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<210> 8 
<211> 24 
■ <212> DNA 

<213> Artificial oligonucleotide 

5 

<400> 8 

acaacctatt aatttcccct cgtc 

<210> 9 

10 <211> 24 

<212> DNA 

<213> Artificial oligonucleotide 

<400> 9 
15 atgttggaat ttaatcgcgg cctc 

<210> 10 

" <2il> 39 

<212> DNA 

20 <213> Artificial oligonucleotide 

r 

<400> 10 

gacgacctca gttacggtac gatcggccac gtagcttat 

25. <210> 11 

<211> 35 

<212> DNA 

<213> Artificial oligonucleotide 



30 



<400> 11 

gctacgtggc cgatcgtacc gtaactgagg tcgte 
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CLABMS 

1 . A transposon which comprises an RNA polymerase recognition site and a 
homing endonuclease recognition site. 

5 - 

2. A transposon according to claim 1 which comprises two RNA polymerase 
. recognition sites. 

3 . A transposon according to claim 2, wherein the two RNA polymerase 
10 . . recognition sites are diverse. 

4. A transposon according to claim 3, wherein the two diverse RNA polymerase 
recognition sites are two of a 17 RNA polymerase recognition site, an SP6 RNA 
polymerase recognition site or a T3 RNA polymerase recognition site. 

15 

.5. A transposon according to any one of the preceding claijns which comprises 
two homing endonuclease recognition sites. 

6. A transposon according to claim 5, wherein the two homing endonuclease 
20 recognition sites are diverse. 

7. A transposon according to claim 6, wherein the two diverse homing 
endonuclease recognition sites are an IScel recognition site and a l>I~Pspl 
recognition site. 

25 

8. A transposon according to any one of the preceding claims which further 
comprises a bacterial origin of replication. 

9. A transposon according to any one of the preceding claims which is a 
30 modified Tn5 transposon. 
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10. Use of a transposon according to any one of the preceding claims in a method 
for the identification of an essential or a conditional essential gene. 

11. A method for identifying an essential gene of an organism, which method 
5 comprises: 

(i) providing a library of transposon insertion mutants of the said 
organism, wherein the transposon is a transposon according to any one of claims 1 to 
10; 

(ii) isolating chromosomal DNA from the library of (i); 

10 (in) digesting the chromosomal DNA with a restriction endonuclease that 

is capable of cutting 5* to the RNA polymerase recognition site(s) in the transposon 
and 3' to the RNA polymerase recognition site(s) in the chromosomal DNA flanking 
the transposon insertion site; ; 

(iv) transcribing the resulting digested DNA from the RNA polymerase 

1 5 recognition site(s) in the said DNA 

(v) hybridising the resulting RNA with an oligonucleotide array; and 

(vi) identifying a probe on the oligonucleotide array which corresponds to 
an essential gene of the organism. 

20 12. A method according to claim 1 1, wherein a labelled ribonucleotide is present 
• when transcribing the digested DNA in step (iv). 

13 . A method according to claim 1 1, wherein step (v) is replaced by: 
(v)' reverse transcribing the resulting RNA; and 

25 (v)" hybridising the resulting cDNA with an oligonucleotide array. 

14. A method according to claim 13, wherein a labelled deoxyribonucleotide is 
present when reverse transcribing the RNA in step (v)' . 



30 15. A method according to claim 1 1 or 12, wherein: 
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(a) the transposon is a transposon according to any one of claims 3 to 9; 

(b) step (iv) is carried out by transcribing one aliquot of the digested 
DNA with a first RNA polymerase and transcribing a second aliquot of the digested 
DNA with a second different RNA polymerase; and 

5 (c)- step (v) is-camed-out-by^ 

with the same oligonucleotide array or separately with two copies of the same 
oligonucleotide array. 

16. A method according.to claim 15, wherein in step (b) the two aliquots of 
10 digested DNA are each transcribed in the presence of a different labelled 

ribonucleotide. 

17. A method according to claim 13 or 14, wherein: 

(a) the transposon is a transposon according to any one of claims 3 to 9; 
1 5 (b) step (iv) is carried out by transcribing one aliquot of the digested 

DNA with a first RNA polymerase and transcribing a second aliquot of the digested 
DNA with a second different RNA polymerase; and 

(c) step (v)" is carried out by hybridising the two resulting cDNA pools 
with the same oligonucleotide array or separately with two copies of the same 

20 oligonucleotide array. 

18. A method according to claim 17, wherein the two aliquots of RNA resulting 
from step (b) are each reverse transcribed using a different labelled 
deoxyribonucleotide. 

25 

19. A method according to any one of claims 1 1 to 18, wherein: 

(a) aliquots of the chromosomal DNA are digested separately with 
different restriction endonucleases in step (iii); 

(b) each of the restriction endonucleases is capable of cutting 5' to the 
30 RNA polymerase recognition site(s) in the transposon and 3' to the RNA polymerase 
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recognition site(s) in the chromosomal DNA flanking the transposon insertion site; 
and 

(c) v each aliquot is subsequently treated separately in steps (iv) to (vi). 

5 20. A method according to claim 1 9, wherein two or three aliquots of the 
chromosomal DNA are each digested separately with different restriction 
endonucleases. 

21. A method according to any one of claims 1 1 to 20, wherein step (iii) is 
10 replaced by: 

(iii)' digesting the chromosomal DNA with a homing endonuclease which 
is capable of cutting 5' to RNA polymerase recognition site(s) in the transposon; 

(iii)" digesting the chromosomal DNA with a restriction endonuclease that 
is capable of cutting 3' to the RNA polymerase recognition site(s) in the 
1 5 chromosomal DNA flanking the transposon insertion site; and 

(iii)' " ligating the digested DNA with a biotinylated linker; and 

(iii)"" recovering the digested DNA using streptavidin-coated particles. 

22. A method for identifying a conditional essential gene of an organism, which 
20' method comprises: 

(a) providing a first sample of a library of transposon insertion mutants of 
the said organism (input library); 

(b) providing a second sample of the library and subjecting that sample to 
a conditional restraint; 

25 (c) collecting the mutants that survive the conditional restraint in step (ii) 

to give a second library (output library); 

(d) carrying out a method according to steps (ii) to (iv) of any one of 

• claims 11, 12, 15, 16 or 19 to 21 on the input library from step (a) and on the output 
library from step (c); 

3 0 (e) hybridising the transcrib ed RNA derived from the input library and . 
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from the output library separately to copies of the same oligonucleotide array or, if 
the RNA derived from the two libraries is differentially labelled, to the same 
oligonucleotide array; and 

(f) identifying a probe on the oligonucleotide array (s) which corresponds 
5 to a-conditional-essential-gene of-the organism. 

23 . A method for identifying a conditional essential gene of an organism, which 
method comprises: 

(a) providing a first sample of a library of transposon insertion mutants of 
1 0 the said organism (input library); 

(b) providing a second sample of the library and subjecting that sample to 
a conditional restraint; 

(c) collecting the mutants that survive the conditional restraint in step (ii) 
to give a second library (output library); 

15 (d) carrying out a method according to steps (ii) to (v)' of any one of 

claims 13, 14, 17, 18 or 19 to 21 on the input library from step (a) and on the output 
library from step (c); 

(e) hybridising the reverse transcribed cDNA derived from the input 

* library and from the output library separately to copies of the same oligonucleotide 
20 array or, if the .cDNA derived from the two libraries is differentially labelled, to the 
' same oligonucleotide array; and 

(f) identifying a probe on the oligonucleotide array(s) which corresponds 
to a conditional essential gene of the organism. 

25 24. A method according to claim 22 or 23, wherein the organism is a bacterium 
and the conditional restraint is growth of that bacterium in its host. 

25 . A method according to any one of claims 1 1 to 24, wherein the 
oligonucleotide array comprises probes which are from 9 to 150 bp in length and/or 
30 comprises 1 probe for every 60 to 250 bp of the locus or loci represented on the 
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array. 

26. Use of an essential or conditional essential gene identified by a method 
according to any one of claims 1 1 to 25, or a polypeptide encoded by a said gene> in 

5 a method for identifying an inhibitor of transcription and/or translation of that gene 
and/or activity of a«polypeptide encoded that gene. 

27. ■ A method for identifying an inhibitor of transcription and/or translation of an 
essential or conditional essential gene and/or an inhibitor of activity of a polypeptide 

10 encoded by a said gene, which method comprises: 

(a) identifying an essential or conditional essential gene by a method 
according to any one of claims 1 1 to 25; and 

(b) deter mining whether a test substance can inhibit transcription and/ or 
translation of a gene identified in step (a) and/or activity of a polypeptide encoded by 

15 a said identified gene, thereby to identify a said inhibitor. 

28. An inhibitor identified by a method according to claim 27. 

29. An inhibitor according to claim 28 for use in a method of treatment of a 

20 bacterial, fungal or eukaryotic parasite infection, wherein the essential or conditional 
essential gene in claim 27 is a bacterial, fungal or eukaryotic parasite essential or 
* conditional essential gene. 

• 30. Use of an inhibitor according to claim 28 wherein the essential or conditional. 
25 essential gene in claim 27 is a bacterial, fungal or eukaryotic parasite essential or 
conditional essential gene, in the manufacture of a medicament for usein the 
treatment of a bacterial, fungal or eukaryotic parasite infection. 

31. A pharmaceutical composition comprising an inhibitor according to claim 28 
30 wherein the essential or conditional essential gene in claim 27 is a bacterial, fungal or 
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eukaryotic parasite essential or conditional essential gene and a pharmaceutically 
acceptable carrier or diluent 

r 

32. A method of treating a host suffering from a bacterial, fbngal or eukaryotic 
5 ' parasite infec^on^-whieh-method comprises-the step of administering to~thehost a " 
therapeutically effective amount of an inhibitor according to claim 28 wherein the 
essential or conditional essential gene in claim 27 is a bacterial, fungal or eukaryotic 
parasite essential or conditional essential gene. 

10 33. A method for the preparation of a pharmaceutical composition, which method 
comprises: 

. (a) identifying an inhibitor of transcription and/or translation of an, 
essential or conditional essential gene bf an organism and/or an inhibitor of activity 
of a polypeptide encoded by a said gene by a method according to claim 27 wherein 
15 the essential or conditional essential gene is a bacterial, fungal or eukaryotic parasite- 
essential.or conditional essential gene; and 

(b) formulating the inhibitor thus identified with a pharmaceutically 
acceptable carrier or diluent. 

20 34. A method for treating a host suffering from a bacterial, fungal or eukaryotic 
parasite infection, which method comprises: 

(a) identifying an inhibitor of transcription and/or translation of an 
essential or conditional essential gene of an organism and/or an inhibitor of activity 
of a polypeptide encoded by a said gene by a method according to claim 27 wherein 

25 the essential or conditional essential gene is a bacterial, fungal or eukaryotic parasite 
essential or conditional essential gene; 

(b) formulating the inhibitor thus identified with a pharmaceutically 
acceptable carrier or diluent; and 

(c) administering to the host a therapeutically effective amount of an 
3 0 inhibitor thus formulated. 
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35. An inhibitor according to claim 28, wherein the essential or conditional 
essential gene in claim 27 is a plant bacterial, plant fungal or plant pest essential or 
conditional essential gene. 

5 

36. Use of an inhibitor according to 35 as a plant bactericide, fungicide or 
pesticide. 

37. An inhibitor according to claim 28, wherein essential or conditional essential 
10 gene in claim 27 is a plant essential or conditional essential gene. 

38. Use of an inhibitor according to claim 47 as a herbicide. 

39. A bacterium attenuated by a non-reverting mutation in one or more genes 
1 5 identified by a method according to any one of claims 22 to 25. 

40. A vaccine comprising a bacterium according to claim 39 and a 
pharmaceutically acceptable carrier or diluent. 

20 41. A bacterium according to claim 39 foruse in a method of vaccinating a 
human or animal. 

42. Use of a bacterium according to claim 3 9 for the manufacture of a 
medicament for vaccinating a human or animal. 

25 

43 . A method for raising an immune response in a mammalian host, which 
method comprises the step of a dmini stering to the host a bacterium according to 
claim 39. 



30 



44. A method for preparing an attenuated bacterium, which method comprises: 
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(a) identifying a conditional essential gene in a bacterium by a method 
according to any. one of claims 22 to 25; and 

(b) introducing a non-reverting mutation into a thus-identified conditional 
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45 . A method for the preparation of a vaccine, which method comprises : 

(a) identifying a conditional essential gene in a bacterium by a method 
according to any one of claims 22 to 25; 



(c) formulating the attenuated bacterium with a pharmaceutically 
acceptable carrier or diluent. 

46. A method for raising an immune response in a mammalian host, which 
15 method comprises: 

(a) identifying a conditional essential gene in a bacterium by a method 
according to any one of claims 22 to 25; 

(b) introducing a non-reverting mutation into a thus-identified conditional 
essential gene of the bacterium, thereby to attenuate the bacterium; 

20 (c) formulating the attenuated bacterium with a pharmaceutically 

acceptable carrier or diluent; and 



essential gene of the bacterium, thereby to attenuate the bacterium. 



10 



(b) introducing a non-reverting mutation into a thus-identified conditional 
essential.gene of the bacterium, thereby to attenuate the bacterium; and 



(d) 



administering to the host the* attenuated bacterium thus formulated. 
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ABSTRACT 

TRANSPOSON 

5 A transposon which comprises an RNA polymerase recognition site and a homing 
endonuciease recognition site. The transposon may be used in methods for the 
identification of an essential gene or a conditional essential gene of an organism. 
Genes identified in such methods are useful as substrates for use in screening for 
antibacterials, antiparasitics, fungicides, pesticides and herbicides. 
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CTGTCTCTTA TACACA rCTUITCTATAGTGTCACXn'AAATAGGGATAACAGGG 

TAATGAATTCGTTAATACAGATGTAGGTGTTCCACAGGGTAGC 

CTGCGATGCAGATCCGGATGGCATITCATTAOCTCITrCTCG 

GATCCGAAGATCAGCAGTTCAACCTGTTGATAGTACGTACrAAGCTCTCATGT 
TTCACGTACTAAGCTCTCATGTITA^ 

AAACGCTCATGGCTAACGTACrAAGCTCTCATGGCTAACGTACTAAGCTCTCAT 

GTITCACGTACTAAGCI UICATG TITGAACAATA 

CTTAAATAGCCTCTAAGGTTTTAAGTTTT^^ 

GCTITrAAAGCTTTrAAGGTTTAACGGTTGTGGACAACAAGCCAG 
CATITCATrACCTCTITCTCCGCACGCGACATAGATCCGGAACATAATGGTGCA 
GGGCGCTGACTrCCGCGTTTCGAGACTTTACGAAACACGGAAACCGAAGACCA 
TTCATGTTGTTGGTCAGGTCGCAGACGTTTTGCAGCAGCAGTCGCTrCACGTr 
. - CGCTCGCGTATCGGTGATTCATrCTGCTAAOCAGTAAGGCAACC^ 
TAGCCGGGTCCTCAAGGACAGGAGCACGATCATGGGCAGCCGTGGGGAGGAC 
CGAACGCTGCCCGAGATGCGCCGCGTGCGGCTGCTGGAGATGGCGGACGCG 
ATGGATATGTTCTGCCAAGGGTTGGTTTGCGCATTCACAGGGTGTCTCAAAAT 
CTCTGATGTTAGATrGCACAAGATAAAAATATATCATGATGAACAATAAAACTG 
TCTGCTTACATAAACAGTAATAGAAGGGGTGTTATGAGCCATATTCAACGGGA 
AACGTCTTGCTCGAGGCCGCGATTAAATTCCAACATGGATGCTGATTTATATG 
GGTATAAATGGGCTGGCGATAATGTCGGGCAATCAGGTGCGACAAIGTATCG' 
ATTGTATGGGAAGCGGGATGCGCGAGAGTTGTTTCTGAAACATGGCAAAGGT 
AGCGTTGCCAATGATGTrACAGATGAGATGGTCAGACTAAACTGGCTGACGG 
AATITATGCCTCTrCCGACCATCAAGCATTrTATCCGTACTCCTGATGATGGAT . 
GGTTACTCACCACTGCGATCCCCGGAAAAACAGCATrCCAGGTATTAGAAGAA 
TATCCTGATTCAGGTGAAAATATrGTTGATGCGCTGGCAGTGTTCCTGCGCCG 
GTTGCATrCGATrCCTGTrrGTAATTGTCCTITrAACAGCGATCGCGTATlTCG 
TCTCGCTCAGGCGCAATCACGAATGAATAACGGTTrGGTTGATGCGAGTGATT 
TTGATGACGAGCGTAATGGCTGGCCTGTrGAACAAGTCTGGAAAGAAATGCA 
TAAACTITrGGCATTCTCACCGGATrCAGTCGTCACT 

TGATAACCTTA jl IT 1 1 GACGAGGGGAAATTAATAGGTTGTATTGATGTTGGAC 

GAGTCGGAATCGCAGACCGATACCAGGATCITGCCATCCTATGGAACTGCCTC 

GGTGAGTTITCrCCrrCATTACAGAAACGGCTriTTCAAAAATATGGTATTGAT 

AATGCTGATATGAATAAATTGCAGTTTCATTTGATGCTCGATGAGTTTT^ 

TCAGAATTGGTTAATTGGTTGTAACACTGGCAGAGGATTACGCTGACTTGACG 

GGACGGCGGCTTTGTTGAATAAATCGAACITTTGCTGAGTTGAAGGATCAGA 

TCACGGATCITCCCGACAACGCAGACCGTTCCGTGGCAAAGCAAAAGTTGAAA 

ATCACCAACTGGTCCAGCTACAACAAAGCTCrCATCAACCGTGGCGGGGATCC 

TCTAGAGTCGA CTGGCAAACAGCTATTATGGGTATTATGGGT AATAGGACTCA 

CTATAGGGAG4 TGTGTA TAA GAGACAG 
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Transposon insertions tnto SL1344 superimposed onto the LT2 genome 
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